Creating Standards 



Studies in Manuscript Cultures 


Edited by 
Michael Friedrich 
Harunaga Isaacson 
Jorg B. Quenzer 


Volume 16 



Creating 

Standards 


Interactions with Arabic Script in 12 Manuscript 
Cultures 


Edited by 
Dmitry Bondarev 
Alessandro Gori 
Lameen Souag 


DE GRUYTER 



ISBN 978-3-11-063498-3 
e-ISBN (PDF) 978-3-11-063906-3 
e-ISBN (EPUB) 978-3-11-063508-9 
ISSN 2365-9696 


BY-NC-ND 


This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 
4.0 License. For details go to http://creativecommons.0rg/licenses/by-nc-nd/4.O/. 

Library of Congress Control Number: 2019935659 

Bibliographic information published by the Deutsche Nationalbibliothek 

The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; 
detailed bibliographic data are available on the Internet at http://dnb.dnb.de. 

© 2019 Dmitry Bondarev, Alessandro Gori, Lameen Souag, published by Walter de Gruyter 
GmbH, Berlin/Boston 

Printing and binding: CPI books GmbH, Leek 


www.degruyter.com 



Contents 


The Editors 

Preface-VII 

Transliteration of Arabic and some Arabic-based Script Graphemes used in this 
Volume (including Persian and Malay) — IX 

Dmitry Bondarev 

Introduction: Orthographic Polyphony in Arabic Script-1 

Paola Orsatti 

Persian Language in Arabic Script: The Formation of the Orthographic 
Standard and the Different Graphic Traditions of Iran in the First Centuries of 
the Islamic Era-39 

Esther-Miriam Wagner 

Writing Judaeo-Arabic-73 

Paolo La Spisa 

Cross Palaeographic Traditions. Some Examples from Old Christian Arabic 
Sources — 93 

Nuria de Castilla 

Uses and Written Practices in Aljamiado Manuscripts-111 

Jan Schmidt 

Flow to write Turkish? The Vagaries of the Arabo-Persian Script in Ottoman- 
Turkish Texts —131 

Branka ivusic 

Developing Consistency in the Absence of Standards — A Manuscript as a 
Melting-Pot of Languages, Religions and Writing Systems —147 

Florian Sobieroj 

Standardisation in Manuscripts written in Sino-Arabic Scripts and xiaojing -177 



VI 


Contents 


Jan van der Putten 

A Collection of Unstandardised Consistencies? The Use of Jawi Script in a Few 
Early Malay Manuscripts from the Moluccas — 217 

Dmitry Bondarev and Nikolay Dobronravin 

Standardisation Tendencies in Kanuri and Hausa Ajami Writings — 237 

Lameen Souag 

Kabyle in Arabic Script: A History without Standardisation — 273 

Alessandro Gori 

Beyond ‘agamT in Ethiopia: a short Note on an Arabic-lslamic Collection of Texts 
written in Ethiopian Script (fiddl) -297 

List of Contributors-313 

Indices-315 



Preface 


This volume grew out of the workshop ‘Creating standards: orthography, script 
and layout in manuscript traditions based on Arabic alphabet’ held at the Centre 
for the Study of Manuscript Cultures, University of Hamburg, on 10-11 October 
2013. The convenors of the workshop (and the two first editors of the volume), 
followed the inspiring initiative of Michael Friedrich to compare standards in 
various manuscript cultures influenced by Arabic script. Our initial - and over- 
ambitious - plan was to (a) identify tendencies of standardisation in orthogra¬ 
phy, script and layout, (b) examine the extent to which these three domains of 
manuscript production are related and (c) delineate factors behind standardisa¬ 
tion processes. During the workshop discussions and later in the process of edi¬ 
torial work, it became increasingly clear that the paths of standardisation in the 
domains of language, orthography and manuscript production are not necessar¬ 
ily connected, and the standards are perceived and measured differently in each 
of the domains. This is directly and indirectly confirmed by the chapters of this 
volume, most of which have more confident conclusions about standardisation 
processes in orthography rather than in other domains of manuscript production. 

This book deals with various aspects of standardisation by stepping outside 
the disciplinary and regional boundaries and providing a typological cross-cul¬ 
tural comparison of standardisation processes in writing traditions influenced 
by Arabic where different cultures, languages and scripts interact. A wide range 
of case studies gives insights into the factors behind uniformity and variation in 
Judaeo-Arabic in Hebrew script (8 ,h -12 lh centuries, Esther-Miriam Wagner), South 
Palestinian Christian Arabic (8 ,h -9 lh centuries, Paolo La Spisa), New Persian (9 th - 
11 th century, Paola Orsatti), Aljamiado of the Spanish Moriscos (15 th —17 th centuries, 
Nuria de Castilla), Ottoman Turkish in the Arabo-Persian script (14 th -19 ,h centu¬ 
ries, Jan Schmidt), a single multilingual Ottoman manuscript (late 16 th century, 
Branka Ivusic), Sino-Arabic writing xiaojing in Northwest China (18 th -20 th centu¬ 
ries, Florian Sobieroj), Malay Jawi script writing in the Moluccas (17 th —19 th cen¬ 
turies, Jan van der Putten ), Kanuri and Hausa Ajami writing (17 th -20 ,h centuries, 
Dmitry Bondarev and Nikolay Dobronravin), the Berber language Kabyle in Algeria 
(19 th -20 th centuries, Lameen Souag), and Ethiopian fidal script used in translitera¬ 
tion of Arabic (19 th -20 th centuries, Alessandro Gori). 

A comparative analysis of pathways of standardisation in the twelve manu¬ 
script cultures addressed in this volume allows for some generalisations, as 
follows. Contact situations do not necessarily lead to the exchange of standardised 
orthographic principles. In many cultures, the co-existence of Standard Arabic and 
non-standardised languages spoken and written in Muslim communities poses a 

3 Open Access. © 2019 Dmitry Bondarev, Alessandro Gori, Lameen Souag, published by De Gruyter. 
[MJHISS This work is licensed under the Creative Commons Attribution-NonCommercial- 
NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-202 
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Preface 


paradox: such languages are profoundly influenced by Arabic, but their orthogra¬ 
phies are not modelled on the principle of standardisation. This apparent paradox 
is resolved by the prediction that standards in orthography - one of the domains of 
manuscript culture - are conceptually different from standards in other domains, 
such as format, layout and script. Each domain of manuscript culture develops 
microsystems of standardisation and different domains have different ‘areas’ of 
uniformity and standardisation in a given manuscript culture. Thus, a general ten¬ 
dency observable at the level of physical features of manuscript production is that 
layout and script types tend to be unified, irrespective of orthographic norms and, 
vice versa, orthographic norms develop irrespective of norms applied to physical 
domains of manuscript production. 

The editorial process took us longer than we planned, and we are immensely 
grateful to the contributors for their patience and trust in our collaborative work. 
Our gratitude goes to all the presenters and participants of the October 2013 
workshop for the inspiring exchange of ideas many of which have materialised 
in this volume. It was a great pleasure to work with Carl Carter, Maya Kiesselbach 
and Joe McIntyre who meticulously copy-edited most of the contributions. We 
thank you sincerely for your most helpful corrections, remarks and suggestions. 
Our appreciation goes to the anonymous reviewers for their constructive com¬ 
ments on various parts of the volume. We owe an immeasurable debt to Cosima 
Schwarke who has been a guiding lantern during our long journey. It is thanks 
to your day-to-day support in all editorial matters that this book finally sees the 
light. Our appreciation also goes to Astrid Kajsa Nylander who greatly assisted 
with the final layout of the book. We are most grateful to the editors of the series 
Studies in Manuscript Cultures for taking an interest in this volume proposal. This 
publication project would not have been possible without the financial support 
of the German Research Foundation (DFG) which funds the Sonderforschungsbe- 
reich 950 Manuscript Cultures in Asia, Africa, and Europe. 


Hamburg, Copenhagen, Paris. September 2018 
Dmitry Bondarev, Alessandro Gori, Lameen Souag 
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Dmitry Bondarev 

Introduction: Orthographic Polyphony in 
Arabic Script 


[...] standardization emerges as a complex process whose many facets (linguistic, social, 
cultural, educational, political) we still do not fully understand, and which warrant further 
research from comparative, case-study and interdisciplinary perspectives. (Deumert and 
Vandenbussche 2003a, 11) 

Printing gave rise to a distinct literate culture, and the earlier scribal culture had many of 
the same limitations often attributed to oral culture: individual copyists produced texts 
with idiosyncratic formats, conventions and mistakes, whereas printing allowed a large 
number of identical texts. (Barton 1994,124) 

Manuscripts originate from literacy practices embedded in numerous social 
domains, such as education, administration, religion and trade. Expressed in 
spoken and written languages, various social activities prompt the development 
of organising principles and structures, which in turn serve as models for the 
agents and participants of literacy practices. The degree to which such organi¬ 
sing models may develop varies from lax to strict. The strict models of literacy 
practices are usually regulated by sets of standards. When a certain literacy 
event takes place - writing a letter, copying a poem or commenting on a canoni¬ 
cal text, for example - then the regulatory normative patterns (or their absence) 
may variously be reflected in the resultant manuscript. The size and form of the 
manuscript is one such indexical feature, the layout another, and the type and 
style of script and spelling conventions are yet another feature indicative of the 
degree of standardisation imposed on the scribe. If such features are examined 
in relation to each other rather than separately, and if the patterns of their relati¬ 
onship in one manuscript culture are compared to the patterns in other cultures, 
we may learn a great deal about the underlying forces of literacy and specifically 
about language in its relation to the manuscript medium. 

A holistic comparison of the sort in line with the stance of the first quote in 
this chapter would shift manuscript studies to a previously unexplored vantage 
point. We would not only understand which components of a manuscript culture 
were historically more impervious to stabilisation and standardisation, and 
which disfavoured variation, but we would also come close to big WHY questions. 


I am most grateful to Michael Friedrich and Lameen Souag for their constructive comments on 
earlier drafts. 


3 Open Access. © 2019 Dmitry Bondarev, published by De Gruytei ( r 9^3113113 This work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-001 
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Why do some cultural models lead to totalitarian types of standardisation of 
writing, such as Western societies? Why do others keep on with limited stan¬ 
dardisation - strict in one social domain and lax in another? Such would be the 
cultures with fixed orthography and regulated reading of codified texts (e.g. the 
Qur’an) and non-codified writing practices in the languages or language varieties 
other than those of the codified texts. And why do the other cultural models exist 
without any standardisation? Or do such cultures exist at all? 

However promising a holistic comparison might sound, we are still far from 
that illuminating vantage point. It might in principle be feasible to carry out a 
study on the multiplicity of factors behind standardisation (or failure thereof) 
in one manuscript culture, but a comparative study of several cultures seems an 
enormous task. This is because in order to make the comparison typologically 
valid, we need to identify social domains related to manuscript production for 
each culture (by no means static), then study norms, prescriptions and codes in 
each identified domain. Only then will we arrive at substantiated observations 
about the standardisation factors in the history of a manuscript culture. However, 
to the best of my knowledge, no studies have tried to treat all the possible factors 
of standardisation in one manuscript culture holistically yet, let alone compa¬ 
rative cross-cultural studies. This is not surprising, actually. In Western socie¬ 
ties, the (positive) notion of a standard developed at the time of print, long after 
manuscripts ceased to be the prime medium of literacy practices. The manuscript 
age was seen as a pre-standard stage in the history of the development of written 
languages, this history culminating in standardised print culture. 1 So the study of 
standardisation was in the areas of human activity where it was expected, which 
excluded manuscripts. 

The long history of successful attempts to eliminate variation in spoken and, 
especially, written European languages and to promote the primacy of a stan¬ 
dard led to scholarly frameworks with dismissive attitudes to variation in written 
texts (manuscripts). Up until sometime in the middle of the 20 th century, lingu¬ 
ists considered textual variation in manuscripts as an uncomfortable situation 
resulting from ‘idiosyncratic formats, conventions and mistakes’, as expressed 
in the second quote in this chapter. Recognising variation as an important 
factor in understanding language in its spoken and written form and in literacy 
practices in general was a novelty in some disciplines in the 1970s and 80s, only 
recently gaining momentum in linguistics, sociolinguistics, literacy studies and 


1 See the discussion inter alia in Eisenstein 1979, 1983, Williams 1981, Stubbs 1980, Bullough 
1991, Barton 1994, Linn and McLelland 2002, Agha 2007, Sebba 2007, Stenroos 2018 and Van der 
Horst 2018. 
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manuscript studies. 2 It may seem a truism that understanding variation is essen¬ 
tial for understanding what kills it, namely standardisation. But despite the deve¬ 
lopment of variation-oriented studies (and thus concerned with standardisation 
in one or another way) in linguistics, sociolinguistics, philology and generally in 
manuscript studies, little has been done so far to approach variation/standardi- 
sation phenomena holistically, involving interdisciplinary dialogue. 3 


1 Standardisation: why sociolinguistics? 

A manuscript is a meeting place of different cultural practices and domains. 
Some of these practices can be recognised visually in the manuscript’s size, form 
and the material it is made of (all of these roughly corresponding to the crafts 
of bookmaking) or in the layout, script type and style, orthography and lan¬ 
guage (the scribal domain). Understandably, there has been a division of labour 
between specific disciplines dealing with these different sociocultural domains. 
The material, size, form and layout of manuscripts are common fields of investi¬ 
gation for codicologists, script type and style are in the scope of palaeography, 
and orthography and language are treated by philology and (socio)linguistics. 
It is instructive to learn that of all the disciplines, the only one that has develo¬ 
ped a systematic approach to the study of standards (and the dichotomy between 
standard and variation) is sociolinguistics. That is not to say that codicology, 
palaeography and philology are not concerned with standardisation tendencies. 


2 In linguistics, the quest for comparative cross-cultural research into standardisation started 
with Jespersen (1925, 46) and was coined ‘comparative standardology’ by foseph (1987, 13) and 
resulted in a comprehensive comparative work on Germanic languages (Deumert and Vanden- 
bussche 2003a) (see especially Deumert and Vandenbussche 2003b, 1), which is discussed in the 
following sections. For manuscript studies, see Sobieroj 2016 and his overview of a recent trend 
in Arabic studies to ‘place variance itself in the focus of research’ (Sobieroj 2016, 2). 

3 One significant exception is a collection of articles edited by fennifer Cromwell and Eitan 
Grossman (2018), Scribal Repertoires in Egypt from the New Kingdom to the Early Islamic Period. 
This work deserves special attention. With their focus on the exact opposite of the subject of 
this volume, there is a shared goal to study the connected phenomena - ‘their’ variation and 
‘our’ standardisation - in the complex linguistic and extra-linguistic dimensions. Although the 
cultural and geographic scope of Cromwell and Grossman 2018 is Egypt, their book is in essence 
a cross-cultural and typologically oriented comparative study since it covers cultures in Egypt 
which co-existed or replaced one another in the course of four millennia, while the typological 
frame is given through the lens of historical sociolinguistics informed by European philology 
(especially studies of pre-modern English). 
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though. Standard(ised) practices are by all means mentioned or studied in litera¬ 
ture from these disciplines (inter alia Beit-Arie 1992, 2017, George 2007, Deroche 
2006, Gacek 2009, Pollock et al. 2015, etc.). In these fields, the word ‘standard’ 
is used in many different senses within its semantic domain, ranging from a 
source of authority to a level of achievement. 4 In sociolinguistics, however (and 
more generally, in linguistics), it has a narrower scope of ‘language codification 
leading to elimination of variation’ (more on this definition below). The causes 
and consequences of language codification have been discussed in various bran¬ 
ches of (socio)linguistics across major topics such as social and linguistic identity 
(Milroy and Milroy 1992, Agha 2007), language varieties and dialects (Trudgill 
1979, Biber and Finegan 1994, Ferguson 1994), language variation and change 
(Romain 1982, Milroy and Milroy 1985, Chambers and Schilling 2013), the distinc¬ 
tion between speech and writing (Biber 1995, Biber and Conrad 2009, Lillis 2013, 
Lillis and McKinney 2013), the development of writing systems and language 
planning (Fishman 1974, Grenoble and Whaley 2006, Sebba 2007, 2009), and 
the sociolinguistics of reading and writing (Stubbs 1980, Street 1993, Blommaert 
2005). It is a matter of course that without writing there would not have been any 
manuscript cultures, so the concept of writing seems to be the most natural node 
connecting manuscript studies and the discussion of standardisation in sociolin¬ 
guistics. These research fields are not overtly connected, however. The following 
aims to reveal some interdisciplinary bridges. 

Sociolinguistics emerged as a subfield of linguistics in the late 1960s/1970s, 
at a time when written language was only marginally considered worthy of lingu¬ 
istic investigation (Barton 1994, 2007, Lillis and McKinney 2013, Stenroos 2018). 
Naturally, there was not much discussion about writing, let alone about writing in 
the ‘manuscript age’. 5 This changed a decade later with the ever-growing anthro¬ 
pological and sociolinguistic enquiry into literacy and the relationship between 
speech and writing (Scribner and Cole 1981, Stubbs 1980, Goody 1987) and with 
the formation of historical sociolinguistics, which focuses on extra-linguistic 
factors as a way of explaining language change (Weinreich et al. 1968, Romain 
1982, Mattheier 1988). 


4 An illuminating short overview of the historical and semantic scope of the term in English has 
been provided by Raymond Williams, one of the founding figures in Cultural Studies, in his vo¬ 
cabulary of culture and society (1983, 296-99). Also see Williams 1981 (esp. 87-118) on standards 
and standardisation in print cultures. 

5 Mesthrie et al. (2013, 27) acknowledge that ‘the study of writing as a social practice is a rela¬ 
tively new interest in sociolinguistics’. 
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Since then, the sociolinguistics of writing and literacy studies has evolved into 
New Literacy Studies (NLS), which pays great attention to the social and material 
context and modes of writing - and thus has the potential to extend its interest 
to manuscripts as well (Barton 1994, 2007, Blommaert 2005,2008, Lillis 2013, Juf- 
fermans et al. 2014, Weth and Juffermans 2018). NLS’s dynamic approach to the 
codification of written forms of language explores non-unidirectional dimensions 
in the development of literacy practices which counteract the ‘tyranny of writing’. 

In the meantime, historical sociolinguistics has grown into a diverse 
field, bringing together linguists, philologists and historians who work with 
manuscripts. The increased interest in manuscripts, not only as a mine of data, 
but as a subject of study in its own right, was prompted by the drive to make 
the ‘best use of bad data’ (Labov 1994,11, referring to written artefacts with their 
scarcity of background information and their texts skewed to the registers of the 
educated). The result was a number of publications that were helpful across dis¬ 
ciplines (Hernandez-Campoy et al. 2012, Langer et al. 2012, Wagner et al. 2013b; 
Cromwell and Grossmann 2018). 

Thanks to the interdisciplinary mergers, the field of sociolinguistics seen as 
a whole (with all its interrelated subdisciplines) seems to offer a set of terms and 
approaches relevant to the question of standardisation in manuscript cultures. 


2 ‘Comparative standardology’ 

The written and spoken counterparts of language as topics of study have gone 
hand in hand in the history of European linguistics, with one hand pulling harder 
than the other at different points on this journey (see Barton 1994, 2007 for an 
overview). The major sociolinguistic concepts about standardisation grew from 
the study of spoken languages, which was the initial focus of the discipline. 
However, the written counterpart came onto the scene at a very early stage. 

As said before, standardisation in sociolinguistic terms is generally under¬ 
stood as language codification leading to elimination of variation. This definition 
is a hybrid one, uniting both wider and narrower senses. In a wider sense, ‘[sjtan- 
dardisation refers to the process by which a language has been codified in some 
way’ (Wardhough 2010, 31). In a narrow sense, ‘the process of language standar¬ 
disation involves the suppression of optional variability in language’ (Milroy and 
Milroy 1999, 6, emphasis in the original). 6 The ‘process’ is key here. Many authors 


6 A more functional and explicitly socially oriented definition has been provided by Garvin and 
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try to overcome ‘the somewhat teleological orientation of traditional standardisation 
models’ (Deumert and Vandenbussche 2003c, 457). The process may take unexpec¬ 
ted turns and lead to de-standardisation, to cycles and to intricate relations between 
standards, sub-standards and non-standards against the backdrop of sociocultu¬ 
ral domains - a complex which prompts the notion of ‘standard language cultu¬ 
res’ (Milroy 1999). The question of language standardisation received particularly 
comprehensive treatment in Germanic (socio)linguistics. A systematic comparative 
approach to the study of linguistic and extra-linguistic factors has been elabora¬ 
ted in Deumert and Vandenbussche (2003a). Several instructive points stemming 
from this study seem promising for an integrated analysis of related phenomena in 
manuscript cultures. Deumert and Vandenbussche (2003a,b,c), develop Haugen’s 
(1966a,b) four-way model into a comprehensive framework for what they call ‘com¬ 
parative standardology’ (following Joseph 1987,13, cited in Deumert and Vanden¬ 
bussche 2003b, 1). 

Haugen’s grid of standard language development consists of (1) norm selection, 
(2) norm codification, (3) norm implementation and (4) norm elaboration (Haugen 
1966a,b summarised in Deumert and Vandenbussche 2003b, 4). What is especially 
interesting for the study of manuscripts is that a written variety of the language 
is typically considered a key agent of standardisation at all four stages: ‘it is [a] 
significant and probably crucial requirement for a standard language to be written’ 
(Haugen 1972, 246, cited in Deumert and Vandenbussche 2003b, 3). However, it is 
not necessarily the case that a written standard code initially selected as a model 
will be carried over to the codification phase associated with the creation of gram¬ 
mars and dictionaries which fix the norms in prescriptive mode. Initial written 
standards might be lost, as was the case with Old Frisian (Hoekstra 2003) and Low 
Middle German (Langer 2003). Linguistic competition between different available 
norms may lead to the suppression of one norm and elevation of another, resul¬ 
ting in the co-existence of standard (written) languages or language varieties, each 
covering different social domains and having a suppressive or enriching influence 
on each other. 7 Complex situations of contact between standard and non-standard 


Mathiot (1960, 783, cited in Mesthrie et al. 2013, 20): ‘codified form of a language, accepted by, 
and serving as a model to, a larger speech community’. Romain (2000, 14) defines a standard 
language as ‘a variety that has been deliberately codified so that it varies minimally in linguistic 
form but is maximally elaborated in function’. 

7 See Mattheier 2003 on the co-existence of Latin (in the clerical and literary domain), the 
written Alemannic dialect (between the 9 th and the 13 th centuries in a narrow domain of court 
poetry and epic) and other written vernaculars developing from the 11 th to the 15 th century and 
resulting in the formation and co-existence of four main uniform written linguistic norms, 
namely East Upper German (Bavarian-Austrian), West Upper German (Alemannic), East Middle 
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varieties may result in de-standardisation 8 and the emergence of new regional or 
local norms through the convergence of standard and non-standard norms or the 
convergence of non-standard varieties or through divergence, for example that of 
the Scottish regional norm from the Northern English dialect (Dossena 2003). In 
the process of divergence or convergence, various diglossic situations may arise, 
such as ‘standard/dialect diglossia’ (Deumert and Vandenbussche 2003c, 7), or 
‘medial diglossia’ (writing in one language and speaking in the other [Mattheier 
2003, 212]; Liipke 2011 calls the same phenomenon ‘exographia’). 

In summary, the ‘comparative standardology’ framework provides a useful 
interdisciplinary set of concepts for the study of standardisation, as follows: norm 
selection, codification, implementation and elaboration; co-existence of local 
norms; competition; loss (and thus vestiges); (dis)continuity; de-standardisation; 
centripetal and centrifugal cycles; interaction and contact; and divergence and 
convergence. 

The important notion of ‘standard language culture’ as well as the distinc¬ 
tion between a standard language sensu strictu and the process of standardisation 
developed within this framework invites connections with the study of manuscript 
cultures. In recent comparative studies on linguistic variation and change in 
manuscript traditions, the notion of a standard was seen critically as being too 
teleological and unidirectional and hardly applicable to the multilingual environ¬ 
ment of earlier manuscript cultures with their variation of registers and linguistic 
codes. Thus, in her study of late Middle English scribal practices, Merja Stenroos 
observes that ‘terms such as “standard” and “standardisation” may not be very 
useful when applied to fifteenth-century materials’ and that there are cases which 
‘do not fit into a unidirectional view of the standardisation process’ (2013, 160). 
The ‘comparative standardology’ approach helps in this respect as it offers epis¬ 
temological scope to include all the cases in standardisation studies that are not 
covered by the models of standard languages sensu strictu. 


3 Written language and orthography 

Even though spoken standards typically develop hand in hand with their written 
counterparts, standardisation of writing differs considerably from standardisa- 


German (Saxonian) and West Middle German (Franconian). 

8 See Greenberg 1986 and Ferguson 1988 on ‘standardisation cycles’ understood as ‘a succession 
of periods of focus with standardization and periods of diffusion with dialect differentiation’ (Fer¬ 
guson 1988,121). 
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tion of speech. The ‘writing system [...] is relatively easily standardised’, whereas 
‘absolute standardisation of a spoken language is never achieved’ (Milroy and 
Milroy 1999,19). This is not surprising given the difference between the linguistics 
structures meant to be covered systematically by the writing system and the struc¬ 
tures covered by the spoken language. The scope of writing systems is limited to 
a countable number of items: smaller numbers in phonographic systems (from 
phonemes to syllables) and much larger, but still finite ones in phono-logogra- 
phic systems (from phonemes to words). The scope of language is a nearly infinite 
number of grammatical structures and variant forms. This difference is empiri¬ 
cally observable in various alignment scenarios whereby standardisation of lin¬ 
guistic structures may develop without standardisation of orthography or at a 
different pace to it, be they interrelated (in some societies) or unrelated. Thus, 
English orthography has changed very little since the codification activity of the 
18 th -century prescriptivists, but codification of the spoken language has been less 
successful (Milroy and Milroy 1999,28; Agha 2007 [chapter 4], Sebba 2007). In the 
case of Persian, the orthography was standardised together with the emergence 
of New Persian and its standardisation into Classical Persian (Perry 2012, Orsat- 
ti). 9 And in the case of Ottoman Turkish, ‘there was no standard form for written 
Turkish and no standardised spelling until the 20 th century’ (Darling 2012, 174; 
Schmidt). Genre-specific standards in linguistic structures developed in episto¬ 
lary writing in Judaeo-Arabic, featuring strong spelling variation (Wagner 2010, 
2013, Wagner). Standard Spoken Tamil is reported not to have a standard ortho¬ 
graphy counterpart (Schieffman 1998). 

A writing system in its visual graphic representation is the interface between 
linguistic structures and manuscripts. 10 Language is converted into manuscripts 
through a graphemic code, and it is through this code that linguistic structures 
are retrieved from manuscripts. This trivial remark is meant to remind us that 
orthography as a set of spelling conventions (be it strict or lax) is inseparable from 
written artefacts. So, the study of orthography should be intrinsic to research on 


9 The authors of this volume are indicated in italics. 

10 I avoid the simple dichotomy of speech vs manuscript (writing) because many linguistic 
structures are predominantly realised in writing and many speech discourses are not meant for 
writing. Potentially, any linguistic structure can be written down, but not all structures are feasi¬ 
ble in spoken language. The study of the relationship between speech and writing has a long and 
rich history spanning more than half a century. For more recent treatment and an overview of the 
topic, see Barton 2007, Biber and Conrad 2009, Lillis 2013. In historical sociolinguistics, a pro¬ 
ductive approach is to treat written data in historical documents as ‘text language’ (Fleischman 
2000) or ‘manuscript language’ (Stenroos 2018). 
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manuscript cultures, and insights from sociolinguistics are equally helpful in this 
respect. 11 

The notion of orthography has two terminological poles. The first defines 
orthography in the narrow sense as ‘the standardized variety of a given, language- 
specific writing system’ (Coulmas 2003, 35), the definition very closely linked 
to the word’s etymology (‘correct writing’, German Rechtschreibung or Russian 
pravopisanie [upaBOiiucaHue], etc.). The second definition has a wider scope: ‘the 
set of conventions for writing words of the language’, which leads to ‘the notion 
of orthography as social practice’ (Sebba 2007,10-11,13). 12 As Mark Sebba puts it, 
‘Orthography is par excellence a matter of language and culture’ (2007, 7). 13 

Following the lead of literacy studies (Scribner and Cole 1981, Street 1984, 
Barton 1994, Gee 1990), Sebba’s view of orthography is reminiscent of the ‘compa¬ 
rative standardology approach’ in that it sees orthography as a dynamic concept 
situated in social and cultural practices rather than as a fixed entity (Sebba 2007, 
13). Such a sociocultural approach allows us to recognise orthography as part of 
changing literacy practices, in contrast to the ‘autonomous models’, which treat 
orthography as ‘neutral technology that can be detached from specific social con¬ 
texts’, as defined by Street (1984,1) regarding the notion of literacy and applied to 
orthography by Sebba (2007,14). Discussing English orthography, Sebba makes a 
very important methodological statement: 


11 In various philological fields, orthographic variation is not usually studied as a process or 
practice. Rather, it is seen as a means of reconstructing ‘original’ texts or pronunciation/sound 
systems (den Heijer et al. 2014) or as a means of studying language change (Wagner et al. 2013b). 

12 The restrictive definition of orthography might be convenient to contrast institutionalised 
regulatory mechanisms with opposing tendencies of norm deviation and de-standardisation (in 
terms of struggling against the tyranny of writing; see Weth and Juffermans 2018 or Blomma- 
ert 2008, 7, who sets off orthography as normative, set against ‘hetero-graphy’); or it might be 
helpful to appraise the effects of Western models of language standardisation when designing 
orthography for previously unwritten and/or minority languages (see Lane et al. 2017). From 
a historical perspective, the narrow sense of the term might not be useful, given that the rise 
of orthographies as ‘absolute’ standards is a recent phenomenon. The fact that the notion of 
orthography does not yield easily to the restrictive meaning can be seen ironically in Rutlcowska 
and Rossler (2012, 214), who first define the term in the narrow sense (‘a spelling norm which 
consists of all the standardized and codified graphic representations of a language’), but then 
use it along the wide continuum from an unstandardised orthography characterised by variance 
to a standardised orthography without any variance. 

13 Lillis (2013, 24) widens the definition even further: ‘Orthography: 1. A writing system speci¬ 
fically intended for a particular language 2. A particular way of performing/producing a writing 
system of a particular language (for example, types of handwriting, fonts, spelling conventions 
used to represent verbal language)’. 
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This possibility of variation and deviation (licensed or unlicensed) from the conventional 
norms makes it reasonable to think of orthography as a social practice - a widespread and 
recurrent activity which involves members of a community in making meaningful choices, 
albeit from a constrained set of possibilities. (2017, 31) 

‘Meaningful choices, albeit from a constrained set of possibilities’ (or ‘repertoires’) 
were indeed recurrently made by the scribes of the manuscript cultures discussed 
in this volume. 

Orthography, as a set of conventions, may have patches of standard spelling 
within a system of internally organised sets conditioned by a multiplicity of lingu¬ 
istic and extra-linguistic factors. The variable application of conventions leading 
to combinations of orthographic tendencies which sometimes developed in a non- 
unidirectional way is demonstrated by many of the contributors to this volume. 


4 Written language: terminology 

Before expounding one crucial difference between orthographic standards and 
non-orthographic standards in manuscripts, it is worthwhile outlining the terms 
associated with orthography as they will frequently be evoked in this book. 
Coulmas (2003, 35-6) provides a useful set of terminology, in part summarised 
below and supplemented with other definitions - heuristically useful, if some¬ 
times conflicting. 

Writing system refers both to ‘the writing system of an individual language 
and to an abstract type of writing system’ (Coulmas); or ‘it is a means of repre¬ 
senting graphically a language or group of languages’ (Lillis 2013, 24). 

- Script stands for ‘the graphic form of the units of a writing system’ (Coulmas). 
Sebba 2007,11 and Lillis 2013,24 consider script a synonym of‘writing system’. 
Orthography (as mentioned earlier) is ‘the standardized variety of a given, 
language-specific writing system’ (Coulmas) or it is a set of conventions for 
writing words of the language (Sebba 2007,10). 

- Spelling is ‘the application of those [orthographic] conventions to write 
actual words’. Thus, ‘I am spelling the words of this sentence according to the 
orthography of English using the Roman writing system (or script)’ (Sebba 
2007,11), whereas in Coulmas’ opinion the term is ‘used interchangeably with 
orthography’. 

- Alphabet has several meanings, but it should be restricted to systems ‘where 
signs individually denote consonant and vowel phonemes’ (Daniels 1997, 
370). Sebba 2007 and Lillis 2013 use the attributive form ‘alphabetic’, refer¬ 
ring to a system based on consonants and vowels as individual units. A fine- 
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tuned definition of related terms (alphabet, abjad, abugida) is proposed in 
Daniels (1990). 

- Letter, in the most general sense, refers to ‘the basic functional units of all 
writing systems’. In a narrow sense ‘it refers to the basic symbols of Semitic- 
derived writing systems, including the Latin alphabet’ (Coulmas). 

Grapheme refers to ‘the abstract type of a letter and its position in a given 
writing system’ (Coulmas). Some linguists do not draw a sharp line between 
letter and grapheme, thus, ‘the smallest independent unit of the writing 
system — for example, a letter of the alphabet or a character in Chinese’ 
(Sebba 2007, 169). In this volume, irrespective of individual approaches to 
what constitutes a letter and grapheme, angled brackets are used for trans¬ 
literation of non-Roman-based letters/graphemes. For example, the Arabic 
letter v is represented as <b>. 

Graph is ‘a single visual sign or mark’ or ‘any written character or mark’ (Boltz 
1994,19,180). 

- Phoneme, on which the term ‘grapheme’ is modelled, is an abstract notion 
denoting the smallest distinct unit of sound. Phonemes are represented in 
slanted brackets, as in /b/. 

- Phone, or sound, is the acoustic realisation of a phoneme. Phonetic transcrip¬ 
tion is given in square brackets, so an aspirated pronunciation of the phoneme 
/b/ would be written as [b h ], for example. 

Some confusion may arise because of palaeographic usage of certain related 
terms, such as ‘graphic’/'graphical’ and ‘script’. Unlike ‘graphemic’, which refers 
to the abstract level of ‘grapheme’, ‘graphic’ and ‘graphical’ refer to the shape 
or visual depiction of a sign. Different realisations of graphic shapes result in 
different script types and script styles. From this viewpoint, palaeography is inte¬ 
rested in visual patterns of a given script rather than in abstract structures of 
writing systems. The abstract components of writings systems are conceptualised 
as types in linguistics and philosophy, whereas concrete instances of types, or 
their spatio-temporal particulars (e.g. ink composition), are tokens (Wetzel 2006). 
However, the word ‘script’ in the palaeographic sense of the term also has abs¬ 
tract and concrete components, which are especially discernible in manuscript 
cultures with standardised script types. This leads me to the next point - the dif¬ 
ference between orthographic and non orthographic standards in manuscripts. 
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5 Orthography and manuscripts: on the 
assessment of standards 

Compare these two statements: 

1. ‘Paper is produced according to standard sizes [...] For instance, a quarto 
from Royal paper (4°R) is 30 x 22, and a folio from Common paper (2°C) is 
31 x 22 (or somewhat smaller, owing to trimming in binding).’ (Gumbert 2010, 
emphasis added); 

2. The word ‘standardisation’ is spelled with an <s> in standard British English 
and with a <z> in standard American English. 

In (1) there is a certain range within the standard sizes (‘or somewhat smaller’). In 
case (2), it is an either/or principle. 

Standards in manuscript form, layout and script are measured and perceived 
differently compared to standards in orthography. As regards layout, variation in 
ruling patterns and varying sizes between the edge of the paper and the edge of 
the text area would not contradict a general standard of using intended principles 
(Andrist et al. 2013, 94), even if the proportions and dimensions were regulated by 
geometry (Deroche et al. 2006,169-71; George 2007). Rules governing standards of 
script style, that is, the shapes and proportion of graphic units, might also be strict 
and yet there is a certain amount of scope within which the inevitable variation in 
production (tokens) is permissible. 

In orthography, what is regulated by a standard is the abstract graphemic 
representation (type). The abstract match between phonemes and graphemes 
has to be absolute, whereas script style does not have to be. For example, if the 
phoneme /b/ is prescribed to be written as <b>, then it should not be written as <d> 
even if the shape of the letter may seem similar; but if the style of script is prescri¬ 
bed to be executed in a certain shape, e.g. at an angle, as in italics, some deviations 
from that particular shape (the angle in the case of italics) will still be counted as 
the prescribed style. So, in writing, matches of abstract types (between phonemes 
and graphemes) are regulated by orthography, and matches of types and tokens 
(graphemes and their shapes, or a type of layout and its realisation, or a type of 
manuscript form and its realisation) by other domains. 

This is not to say that script style does not have the abstract type as a concep¬ 
tual counterpart of the shape of a letter - it certainly does. But what is important is 
that in prescribed realisation of graphemes as representors of phonemes, the abso¬ 
lute match is the requirement, whereas in realisation of shapes, what is required is 
approximation to the abstract. 
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Another way of illustrating the type/token distinction in relation to ortho¬ 
graphy and script is as follows. The script’s coverage of phonemes is introduced 
through tokens: <b>, <$>, <B> and <b> all stand for /b/; the letter <b> can be con¬ 
ceptualised as a ‘vertical ascender bar with a single bottom loop/two loops facing 
right’. Changing a single compositional component is enough to write the letter 
incorrectly. Thus, if the component ‘facing right’ is altered, this results in <d> or <d> 
instead of <b>, and the violation of ‘vertical ascender’ gives <p> or <p>. 

In complex orthographic systems (like English), morphological, lexical or posi¬ 
tional parameters define the correct representation of a sound. Thus, fish cannot 
be represented as ghoti, as suggested by Bernard Shaw, because <gh> is only /f/ 
when it is used at the end of a word (e.g. in ‘enough’), <o> is just incidentally/i/ in 
the word ‘women’, and <ti> is only /// in a single orthographic unit: <-tion> (e.g. 
‘nation’). 14 The combination of rules may be complex, but a single misrepresen¬ 
tation at the level of types will be enough to invalidate the correct spelling. Thus, 
enough is not wrong because it is written in italics or as enough or enMcfi, but 
because the graphic unit <gh> is faulted by a single abstract segment, <b>. 15 

The set of rules governing non-orthographic features in manuscripts is not just 
different, but much more complex. Many are familiar with the notorious problem of 
describing a script type/calligraphic style verbally without showing the examples, 
even if the given script type has a precise set of abstract features (as in the geome¬ 
trical tradition of certain Arabic calligraphic hands). Even though the scribes who 
used such scripts employed geometry as an abstract model based on mathematics, 
a single deviation from one of the many parameters does not invalidate a script 
style, layout or paper form. 16 In contrast, a single deviation from one of the para¬ 
meters for a given grapheme will invalidate a letter. 

I took this digression about type/token relations to demonstrate that the 
dimension of orthography cannot be directly compared with the other dimensi¬ 
ons of manuscript production. This difference predicts that standardisation in the 
domain of orthography should not necessarily lead to standardisation in the other 
domains, such as form, layout and script (and vice versa). A similar unrelatedness 
has been mentioned earlier regarding the disparity between standardisation of lan¬ 
guage and that of orthography. 


14 See Stubbs 1980,51 for his discussion of the spelling <ghoti> proposed for fish by Bernard Shaw. 

15 The discussion of the ontological relationship between type and token is much more com¬ 
plex, questioning the existence of types, differentiating tokens from occurrences, etc. A more 
subtle model of orthographic validity could be described in terms of matches between types and 
occurrences (which are non-material instances of types) rather than between types. But since 
both types and occurrences are abstract entities, the general idea still seems relevant. 

16 This argument might not stand a chance in (post-)print societies with totalitarian standardisation. 
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6 Arabic script for non-Arabic languages 

Manuscript cultures based on Arabic script are especially interesting from the 
viewpoint of the theoretical considerations outlined above. Early codification 
and standardisation in many domains of manuscript production was the charac¬ 
teristic feature of Islam and Arabic as the language of the Qur’an. At the time 
when the Muslim Arabs started interacting widely with other cultures, introdu¬ 
cing them to Islam or encompassing them within the realms of various Islamic 
polities, standard orthography and regulated ways of producing manuscripts 
were already part of the Islamic cultural package. 

The standard orthography of Arabic script was codified during the first one- 
and-a-half centuries of Islam as the outcome of the standardisation of the Qur’an 
text. By the late 2 nd /8 th century, ‘the system of Arabic orthography was almost 
completed and [...] it has remained essentially the same ever since’ (Versteegh 
2001, 57). The process of standardisation of Classical Arabic was slower, but 
nevertheless it was completed by the 4 th /10 ,h century. The corpus of the language 
is believed to have been closed from that time onwards (Versteegh 2001, 64). 17 

Other domains of standardisation brought with Islam and visible in 
manuscripts are form, layout, script type and genre. However, unlike Arabic 
orthography, these changed with time and with cultures. Many remarkably unified 
types of format, layout and script tied to particular genres existed from the ear¬ 
liest centuries of Islam (George 2007) to the latest transitions from manuscript to 
print (Dobronravin 2017). Many of these types stemmed from the configuration of 
complex literacy practices born out of interaction between norms and standards 
of the contacting Islamic Arabic culture and non-Islamic non-Arabic cultures. The 
contributions in this volume - roughly organised in chronological order - deal 
with such contact phenomena, looking at various domains of the standardisation 
process. Eight chapters (2, 5-11) focus on writing traditions which adapted the 
Arabic script for non-Arabic languages, two chapters (3,12) are respectively con¬ 
cerned with Hebrew and fidal scripts used in the contexts of close contact with 
written and spoken Arabic, and one chapter (4) investigates possible influences 
of the Qur’an manuscript standards on Christian Arabic manuscripts {La Spisa). 
In some less well-studied cultures presented in this volume, the writing system 
based on Arabic script and orthography (in the general sense outlined earlier) 


17 It should be noted that these dates are only helpful as general guidelines for periodisation 
of the norms of Classical Arabic because, as den Heijer (2012, 10) puts it, ‘an overall history of 
Arabic orthography, which only partly overlaps with palaeography (a much better documented 
and studied issue!) is yet to be written’. 
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is the first entry point to the respective manuscript cultures {van der Putten 9, 
Bondarev and Dobronravin 10, Souag 11, Gori 12). In some better-studied cases, 
orthography and interaction between different writing systems and scripts are 
investigated with the new findings at hand {Orsatti 2, Wagner 3) and in the other 
traditions, some better studied, some little studied, standardisation of orthogra¬ 
phy is compared with that of script types, language, genre, layout and format {de 
Castilla 5, Schmidt 6, Sobieroj 8). The orthographic features of a single manuscript 
written in no less than seven languages, all in Arabic script, are discussed in 
chapter 7 {Ivusic). 

The study of the Arabic script as the medium for writing non-Arabic langu¬ 
ages and research into the interaction of Arabic script with non-Arabic langu¬ 
ages and scripts are by no means a novelty, Mohammed Naim’s (1971) survey 
being one of the earliest. However, previous research has touched upon these 
issues from the perspective of established disciplines and regional studies. Thus, 
Spooner and Hanaway (2012b) is a collection of papers on a wider topic of liter¬ 
acy in the Iranian cultural areas, with some articles addressing standardisation 
in various domains of language use and manuscript production, mostly in rela¬ 
tion to Persian, but also dealing with Ottoman Turkish (Darling 2012) and giving 
comparative insights into the normativity of Arabic, Persian and Latin (Morton 
2012). Some instances of Arabic script used for writing non-Arabic languages are 
discussed in Script Beyond Borders: A Survey of Allographs Traditions in the Euro- 
Mediterranean World (den Heijer et al. 2014). This collection of articles focuses 
on what the authors call ‘allography’: the phenomenon of writing a language in 
the script of another language. The comparative scope of Script Beyond Borders is 
vast, albeit restricted to the cultural areas prominently featured in the philologi¬ 
cal and historical disciplines. A collection of studies in Zack and Schippers (2012) 
looks into variation and development of standards in the context of interactions 
between religions, scripts and linguistic varieties of Arabic known as Middle 
Arabic and Mixed Arabic - the topic within the traditional scope of Middle East 
Studies. 

Adaptations of Arabic script for writing non-Arabic languages have also been 
treated in specialised research, such as Daniels 1997,2014, who provides a theore¬ 
tical background from the perspective of linguistics, Kaye 1996, 2006, who maps 
the Arabic script in various world languages, 18 Dobronravin 1999, which is the 


18 Kaye (2006) mentions Berber, the Dravidian language Moplah, a dialect of Malayalam (rela¬ 
ted to Tamil), the Indo-Arian languages Urdu, Sindhi and Kashmiri; the Iranian languages Ba- 
lochi, Pashto, Persian and Kurdish; the Austronesian languages Malagasy, Malay and Sulu; and 
Turkic and Caucasian languages. He also mentions special aspects of eleven languages: Persian, 
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first comprehensive study of the application of Arabic script to African and some 
other languages, and Mumin and Versteegh 2013, a collection of papers covering an 
impressive range of African languages written in Arabic script. 19 

What makes this volume different from previous literature is its attempt to 
study Arabic-based writing systems from the perspective of ‘comparative stan- 
dardology’, stepping outside the traditional disciplinary and regional boundaries 
and treating such systems in the context of manuscript production and (recons¬ 
tructed) social practices. A typological cross-cultural perspective is provided by 
a wide range of case studies - albeit limited - presenting twelve distinct writing 
traditions set up in contact situations, whereby different languages, cultures and 
scripts interact. These are as follows: Judaeo-Arabic in Hebrew script (8 th -12 th 
century, Wagner), South Palestinian Christian Arabic (8"’-9' h century, La Spisa), 
New Persian (9 th —11 th century, Orsatti), Aljamiado used by the Spanish Moriscos 
(15 th —17 th century, de Castilla), Ottoman Turkish in the Arabo-Persian script (14 th - 
19 th century, Schmidt), a single multilingual Ottoman manuscript (late 16 th century, 
Ivusic), Sino-Arabic writing in Northwest China (18 th -20 th century, Sobieroj), Malay 
Jawi script writing in the Moluccas (17 th —19 th century, van der Putten), Kanuri and 
Hausa Ajami writing (17 ,h -20 th century, Bondarev and Dobronravin), the Berber 
language Kabyle in Algeria (19 th -20 ,h century, Souag), and Ethiopian fidal script 
(19 th -20 th century, Gori). 20 


7 Factors of standardisation 

As mentioned, the paths of standardisation in the domains of language, orthogra¬ 
phy and manuscript production are not necessarily connected, and the standards 
are perceived and measured differently in each of the domains. With our limited 
knowledge of social practices and manuscript production in earlier cultures. 


Kurdish, Pashto, Kashmiri, Urdu, Sindhi, Ottoman Turkish, Uyghur, Malay (Jawi), Hausa and 
Swahili. 

19 Arabic script applied to non-Arabic languages is referred to as Ajami, from Arabic 'ajami 
‘non-Arab’ (and also ‘Persian’), derived from a collective noun, ‘ajam, meaning ‘barbarians, non- 
Arabs; Persians’. 

20 Due to terminological inconsistency across different disciplines, some of the terms deno¬ 
ting written cultures require preliminary clarification. Thus, Judaeo-Arabic means Arabic texts 
written in Hebrew script, Arabo-Persian is Persian in Arabic script, Christian Arabic stands for 
Christian texts written in the Arabic language and Arabic script, and Sino-Arabic indicates texts 
written in Arabic script but influenced by the Chinese writing practices. Further details are given 
in the respective chapters. 
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there is no simple way to demonstrate such relationships. This is directly and 
indirectly confirmed by the chapters in this volume, most of which come to more 
confident conclusions about standardisation processes in orthography rather 
than in other domains of manuscript production. 

Nonetheless, one possible way of seeing a larger interconnected picture 
would be to identify the factors behind the standardisation of writing. Various 
factors of this kind are presented in this volume. Thus, six chapters are the first 
studies of orthographic conventions in the given cultures ( Ivusic, Sobieroj, van 
der Putten, Bondarev and Dobronravin, Souag, Gori), whereas five others refine, 
critically analyse or summarise received understanding of the better-documented 
cultures ( Orsatti, Wagner, La Spisa, de Castilla, Schmidt). For the latter, it is easier 
to identify connections between norms in social structure, manuscript produc¬ 
tion and orthography. In the less-known cultures, such links are not as obvious. 
The factors outlined below are thus more of a selection of representative pheno¬ 
mena than a representative typological survey. But even the uneven comparison 
yields some interesting results. 

I have grouped the phenomena identified as relevant for the standardisation 
process into seven umbrella classes. These are factors related to (1) contact situa¬ 
tions, (2) authority, in the sense of top-down regulations, (3) networks of scribes, 

(4) identity - both communal and scribe-centred individual aspects of writing, 

(5) genre, as a cover term for socially identified literacy events, linguistic codes 
and registers, (6) language, in the sense of linguistic structures and features con¬ 
ditioning orthographic choices, and (7) medium (manuscript and print). This is a 
simplified classification: the conceptual scope of each of the seven keywords is 
wider and many phenomena cannot neatly be subsumed under a single group of 
factors, while some factors might better be grouped under a separate umbrella 
concept which I may have omitted. I shall try to cross-reference related groups of 
factors and point to possible overlaps as well. 21 


7.1 Contact 

As discussed in the section on comparative standardology, the phenomenon of 
standardisation can be understood as a special type of language contact (Haugen 


21 Unsurprisingly, this grouping has a bias towards sociolinguistics where such factors are ty¬ 
pically in the focus of study. I hope that invoking familiar sociolinguistic notions and relating 
them to the phenomena in manuscript cultures creates a potential for bridging the disciplines on 
more recognisable common ground and thus for achieving better interdisciplinary compatibility. 
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1972, 247) and this is equally relevant for the other domains of culture, too, such 
as religion, writing and manuscript production. 

What appears to be the most obvious factor behind standardisation in the 
Islamic cultures based on Arabic script is the standardised text of the Qur’an, 
which can be seen as the interface between Arabic and non-Arabic cultures in 
contact situations. As Morton (2012,150) puts it: 

The orthography of New Persian has been remarkably stable considering that the language 
has been used for well over a millennium [...] An important factor here is the influence of 
Arabic, the orthography of which has been even more stable than that of Persian and over a 
longer period. In the case of Arabic, stability was encouraged in particular by the attention 
paid to the interpretation of the text of the Qur’an and religious concerns in general. 22 

In many cases, religion barriers were irrelevant for the impact of Arabic ortho¬ 
graphy, as argued by Wagner: ‘the newly emerging Judaeo-Arabic standard was 
heavily Arabicised and written in an orthography where Classical Arabic spelling 
conventions were imposed on the Hebrew letters’. La Spisa equally shows that the 
influence of the orthography of the language of the Qur’an is discernible even in 
Christian Arabic manuscripts. 

However, standardised Arabic (including the Qur’an and Classical Arabic 
texts) was not the sole force exerting influence on standardisation in the contact 
culture. Thus, ‘[t]he models for correct Persian usage emerged in the 9th century 
from the pre-Islamic heritage of the epistolographic practices of the Sasanian 
Empire (AD 224-651)’ (Spooner and Hanaway 2012a, 17). The normalised ortho¬ 
graphy of New Persian written in Arabic script (Arabo-Persian) might have resul¬ 
ted from a convergence of pre-Islamic and Islamic standards of literacy practi¬ 
ces. In turn, as Orsatti demonstrates, once it was established as a stable norm 
by the end of the 9 th century, the orthography of New Persian influenced spelling 
in other scripts in Judaeo-Persian, Syro-Persian and Manichaean texts. A similar 
tendency for orthographic interaction across different scripts is suggested by 
Ivusic for some spelling conventions in Hungarian, Latin and German written in 
Arabic script in a 16 th -century Ottoman manuscript which may have been influ¬ 
enced by German Latin-script orthography. 23 


22 However, as Lameen Souag observes in his comment on this chapter, ‘while the text of the 
Qur’an is highly standardised, its orthography is much less so. The same word may be written 
in one aya with alif and in another without it; in one aya with l, and in another with o... And, 
of course, to the extent that it is standardised, its orthography often differs from what would 
become the much more standardised orthography of Classical Arabic’. 

23 The Ottoman chanceries were essentially embedded in a multilingual environment: ‘Turkish 
was not the only language used in the Ottoman chancery. Over the years the Ottomans emplo- 
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Contact-induced spelling conventions are also common in languages written 
in the same script. Such was the influence, albeit minor, of the non-Ottoman Turkic 
language Chagatai and Azeri Turkish on Ottoman Turkish orthography ( Schmidt ) 
and the influence of Kanuri spelling conventions on Hausa Ajami writing ( Bond¬ 
arev and Dobronravin). 

A special case of the effect that standardised Classical Arabic had on the ortho¬ 
graphy of the contact/target language is retention of the (historical) spelling of Arabic 
words. This is reported for most of the manuscript cultures discussed in this volume, 
even those lacking unified spelling conventions, and can be seen as a micro-area of 
standardisation, whereby the diffusion of a standard spelling is confined to a restric¬ 
ted set of lexical items or grammatical structures ( Bondarev and Dobronravin). 

Convergence induced by contact is also manifest in palaeographic features of 
manuscripts, irrespective of any religious divides, as La Spisa demonstrates with 
Christian Arabic sources. A striking case of interaction is the influence of Chinese 
calligraphy on the Arabic script used in Sino-Arabic manuscripts ( Sobieroj ). In 
manuscript cultures sharing the same religion, the unifying force of contact is even 
stronger (as in the Kanuri and Hausa manuscripts, both cultures being Islamic). 

A complex contact situation involving very different writings systems - ‘conso¬ 
nantal’ and ‘alfasyllabary’ - is discussed by Gori using the example of transliteration 
of Arabic texts in the Ethiopian script fidal, written in the Muslim communities of 
Harar. 24 Gori argues that the standard Arabic orthography of the Arabic text written 
in fidal does not prompt the spelling characteristics of Ethio-Arabic texts. Rather, it 
is oral recitation in Arabic that provides a reference point for spelling choices. 

An important aspect of contact is the tendency of contrast and divergence. 
With increased socio-cultural tensions, the orthography of one language initially 
modelled on the norms of another may de-standardise and take on new normative 
principles, as observed in the change of Judaeo-Arabic from the stage of orienta¬ 
tion based on Classical Arabic in the 11 th century to the period of the 13 th century 
influenced by Hebrew norms {Wagner). Similar divergence is often reported for 
manuscript cultures, with normative patterns in one religion triggering opposite 
patterns in the other, such as retention of the rollbook by the Jews ‘in order to 
differ from the Christians’ (Beit-Arie 1992,11). 


yed scribes who wrote in Latin, Greek, Italian, Uighur, Persian, Arabic, Serbian, Hungarian, and 
other languages [...]’ (Darling 2012,177). 

24 Strictly speaking, Arabic is not purely consonantal, nor is fidal alfasyllabic, but the systems 
differ in their treatment of vowels: Arabic makes short vowels optional, whereas it is obligatory 
to mark all the vowels in fidal. Daniels (1990, 1997, 2014) calls the former ‘not a perfect abjad’ 
(2014, 30) and the latter abugida. 
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7.2 Authority 

Authority is used here in the sense of top-down regulations. These include all 
sorts of centralised controlling mechanisms imposed by political, administra¬ 
tive, religious and other institutions. The contrasting side (in other words, the 
opposite value) of such top-down regulations is ‘bottom-up literacy regimes’ 
(Blommaert 2008, Juffermans et al. 2014, Weth and Juffermans 2018). The phe¬ 
nomena associated with authority are discussed in sociolinguistics literature as 
the most common and typical causes of language standardisation. However, top- 
down authority does not constitute an important factor in any of the twelve cul¬ 
tures presented in this volume. Spooner and Hanaway’s (2012a, 14) survey of the 
history of written Persian is instructive in this regard: 

What is most remarkable is the lack of any central authority to govern usage or establish 
models of correctness. [...] For Persian, [...] there was neither a primary text nor any other 
type of authority besides the heritage of Sasanian bureaucrats, which was gradually suc¬ 
ceeded by the evolving canon of secular Persian literary texts. In this connection it may be 
worth noting that Islamic civilization in general was characterized by a lesser degree of cen¬ 
tralization than other parts of the world, until perhaps the later emergence of what Hodgson 
(1974) calls the Gunpowder Empires: the Ottomans, Safavids, and Mughals. 

The emergence of written New Persian in the 9 th century discussed by Orsatti 
was followed by a centrifugal spread of its standardised variety. But it was the 
prestige of the literary language in the Persian courts of eastern Iran rather than 
any authoritative centralised force which was responsible for the spread of the 
written standard. 

The authority of influential scribes played a significant role in generating nor¬ 
mative spelling and unifying script styles, which were propagated through speci¬ 
fic networks {Wagner). However, such cases of authority fit into the categories of 
network and identity better, which are discussed below. 25 


25 It is interesting to mention the case of the Masoretic Syriac compilations of the 8 ,h -13 ,h cen¬ 
turies because they manifest a strict regulatory standard tradition developed at the time roughly 
coinciding with the time of the Judaeo-Arabic, Christian-Arabic and Arabo-Persian traditions 
discussed in this volume. ‘Based on sample texts they [the compilations] standardize the or¬ 
thographic representation of the pronunciation’ (Juckel 2011, 276). What makes the Syriac case 
special is that the scale of influence imposing an exclusive standardised orthography is greater 
than individual influence propagated via networks and therefore, in its prescriptive force, it is 
comparable to the better-known regulated standardisation processes, such as those in European 
languages. 
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A special orthographic twist indirectly related to top-down regulations is dis¬ 
cernible in Aljamiado manuscripts, which de Castilla discusses. There, centralised 
authoritative efforts of standardisation applied to written Spanish (Castilian and 
Aragonese) 26 in Latin script are visible in Aljamiado writings in Arabic script where 
an Aragonese variety of Spanish is spelled according to 16 th -century conventions. 

Centralised administrative power does not seem to have played a direct role in 
standardisation even in the Ottoman Empire, although the stability of the written 
language was a distinctive feature throughout the history of Ottoman manuscripts 
starting from the late 15 th century. Thus, ‘the stable language tradition in Turkish 
was [...] that of administrative and government documents’ (Darling 2012, 179) 
and ‘the tendency towards standardisation is visible in Ottoman texts’ (Schmidt). 
However, before 1908 there was no formal policy which regulated standards, 
either in manuscript production or in orthography. It is noteworthy that what was 
least formalised was orthography, as opposed to the more fixed uniformity of the 
layout, script style, formulaic expressions and lexicon (each entity conditioned 
by specific written genres). 

In sum, the positive value of top-down control almost seems irrelevant for 
standardisation processes in the cultures that are discussed in this volume. 
Looking at the category of authority from the opposite value - bottom-up liter¬ 
acy - might lead to insights, but these will be phenomena more adequate for the 
categories of networks, identity, genre and language. 


7.3 Network 

In a narrow sociolinguistic sense, a network is ‘a boundless web of ties that 
reaches out through a whole society, linking people to one another, however 
remotely’ (Milroy and Milroy 1992,5). First-order networks are ties directly ancho¬ 
red to individuals, and second-order networks are ties linked through others. I 
expand the notion of network to a wider connotation standing for a communal 
base of social practices in contrast to the individual space. For example, the space 
where reading of a text is shared by people of different ages and social status - 
as in the recitation of canonical texts - creates a specific literacy event network 
which may serve the participating individuals as a reference to normative codes. 
The concept of a network is tightly linked to what I call ‘identity’ in the next 
section, the notion related to individual-based phenomena. 


26 Wright 2006, 608. 
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In many pre-modern cultures, it is the role of scribal networks rather than 
central authority which drove (de)standardisation processes, as shown by 
Wagner and, inter alia, in Wagner et al. 2013a, b: *[,..] local networks, scribal 
schools and gatherings of writers developed their own norms, which they then 
followed’ {Wagner). The network factor, although not specifically addressed by 
La Spisa, was probably at work in the production of Christian Arabic manuscripts 
in the monastic communities of South Palestinian monasteries. 

The standards in written Persian ‘were maintained through the interaction 
within and between chancelleries and the court communities of multiple sulta¬ 
nates’ (Spooner and Hanaway 2012a, 17). By the same token, the chancery net¬ 
works were responsible for the formation and maintenance of standard forms in 
Ottoman Turkish (Darling 2012). 

The network factor also seems to be important in Aljamiado manuscripts of 
the 15 th —17 th centuries. Although they were written by unidentified scribes, de Cas¬ 
tilla argues ‘that most of the manuscripts were produced by skilled copyists’ to 
‘maintain the cohesion of their communities and to control them, at a moment 
when they were losing their rites and cultural practices’. 

Multi-nodal networks were behind the diversification of manuscript tradi¬ 
tions related to Malay in insular Southeast Asia of the 15 th to the 19 th century. Van 
der Putten explores (in)consistency in orthography and other domains of Malay 
manuscripts from the Moluccas, which was one of those nodes. 

Souag, describing orthographic characteristics in Kabyle, the largest Berber 
variety of Algeria, shows that the absence of stable spelling conventions is 
commensurate with the patchy and troubled history of Kabyle educational ins¬ 
titutions: ‘At no point has any one writer’s or school’s Arabic-script work been 
sufficiently widely read to be imitated, and all but the most prominent of one 
generation’s orthographic innovations have been forgotten by the next’. This 
negative finding only confirms the role of networks as one of the primary factors 
of standardisation processes. 

Shared space in literacy practices may have played a role similar to networks 
in stabilising spelling conventions in Old Kanembu manuscripts, as suggested in 
Bondarev and Dobronravin, and in Bondarev and Tijani 2014. 


7 A Identity 

Identity is a scalar concept, ranging from the wider sense of group identity to a 
narrower sense of personal identity, which stands for individual-centred pheno¬ 
mena. By explicitly narrowing down one side of the definition, I try to distingu¬ 
ish between communal-based group identities and individual-based identities. 
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however artificial such a distinction might seem. The heuristic reason for narro¬ 
wing down the notion of identity to a strictly individual level lies in the fact that 
the first-tier relation between the reader of a manuscript and the scribe is basi¬ 
cally a one-to-one relation. When I see a written word penned in a certain hand 
and style, what typically ‘stands’ on the other side of the word is a single hand of 
a single scribe. It goes without saying that any individual identity will most likely 
represent a larger picture of group identity upheld through networks. But it will 
be the individual scribal expression with all its idiosyncrasy which will provide 
the first entry point to whatever might be behind a unique instance of writing. 

From the standpoint of this categorial division, it is informative to explore 
the extent to which the impact of group identity on standardisation differs from 
that of individual identity. The first type of impact might be easier to identify. 
The group identity will not necessarily be directly present (or explicitly named) 
in the artefact. Rather, it will have to be postulated as a result of historical recon¬ 
struction. If the identity of one group opposing another is historically proven, 
however, as is the case for Muslim vs Christian identities in late medieval Iberia, 
for example (cf. Catlos 2014), then it is plausible to see differences in writing 
between these two groups as identity markers. And if there are consistent ways of 
writing associated with one particular group, it is reasonable to postulate identity 
as a factor for such consistency. 

Many chapters in this volume allude, in one way or another, to such reconst¬ 
ructed group identities. The emergence of the New Persian orthographic standard 
during the 9 th century is related to the milieu of the Persian courts which flou¬ 
rished in eastern and north-eastern Iran ( Orsatti ), with the subsequent spread 
of the written New Persian standard driven by the prestige of the literary lan¬ 
guage. The identity of the Jewish middle-class writers as a factor of (de)standar- 
disation process is reconstructable for the Judaeo-Arabic manuscripts from the 
Cairo Genizah, as discussed by Wagner. She also argues on the basis of her own 
and others’ sociolinguistic research that standardisation may be blocked by the 
factors related to regional identity or associated with particular scribal schools. 
Religious identity might be a factor for a micro-standard set of features within a 
larger system of conventions, as demonstrated by La Spisa. Thus, the orthography 
of the Christian-Arabic manuscripts of the Melkite monastic environment, which 
is largely congruent to Islamic manuscript conventions, has some distinctive fea¬ 
tures specific to Christian Melkite writing. The cultural identity of Mudejars and 
Moriscos in Castile and especially Aragon strongly influenced the production of 
Aljamiado manuscripts and associated orthographic conventions (de Castilla). 

Turning to the narrower side of identity, the most obvious instance of indivi¬ 
duals playing a role in the standardisation process is the prestige of those who set 
up the models subsequently copied by the others. The history of standardisation 
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testifies to the importance of individuals standing behind codification processes 
(Agha 2007, Sebba 2007). Individual prestige - one of the most important factors 
‘controlling the assertiveness of linguistic forms’ ( Wagner ) - is the long-standing 
subject of sociolinguistic investigation. Related to prestige, but not necessarily 
fully matched with it, are the instances of (identifiable) individual choices of 
norm or variation and generally, all sorts of idiosyncrasy. Den Heijer (2012,11), 
discussing standardisation in what is known as Middle and Mixed Arabic, sug¬ 
gests that the deviations from normative systems ‘reflect a conscious desire to mix 
registers and styles’. This point is elaborated by Wagner, who shows that indivi¬ 
duals may consolidate microsystems shared by a limited group of manuscripts 
and persons. Similar small-scale individual substandards are observed by van der 
Putten, Bondarev and Dobronravin and Souag. 27 

When doing my research on Old Kanembu manuscripts in Nigeria, I often 
asked the Islamic scholars proficient in the variety of Old Kanembu called Tarjumo 
why people wrote the way they did. The typical answer was: ‘people write however 
they want to, the way they hear it’. Yet on closer inspection, various idiosyncra¬ 
tic choices betray something more systematic than mere idiosyncrasy ( Bondarev 
and Dobronravin). Even though the full scale of personal social relationships, as 
it might be manifest on the pages of manuscripts, will most likely remain out of 
reach due to the inevitable lack of contextual information, the factor of individual 
identity - however difficult it is to tackle - is central to the study of standardisa¬ 
tion. This is because personal identity intersects with all the factors discussed 
here (as well as others that I may have omitted). Although the factors are certainly 
interrelated, the act of implementing a literacy practice - informed by a multipli¬ 
city of factors and scribal repertoires - is ultimately carried out by a single scribe 
in a single hand. 


7.5 Genre 

I use genre as a cover term for linguistic codes and registers embedded in a variety 
of socially identified and conventionalised literacy practices, framing a ‘particular 
perspective on the world’ (Barber 2007, 41). This is wide enough to include issues 
of education, types of texts and manuscripts, scribal repertoires, transmission and 
translation of canonical texts, culturally conditioned preservation of earlier varie¬ 
ties of language, distribution of registers and genres across literacy practices and 


27 On the issue of deviation from the norms conditioned by distinctive group and individual 
identity, see Weth and Juffermans 2018. 
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many other related phenomena. By encompassing all this under one category, I 
argue for the intrinsic interrelatedness of such phenomena, which constitute a 
distinct family of factors. 28 Written language (as well as spoken) is never neutral. 
Each speech and literacy event requires a selection from the available linguistic 
repertoires of speakers and scribes. Some events are flexible and open to a greater 
variation of choices, whereas others are more conservative with a limited range of 
choices, for example in recitation and written transmission of canonical texts. One 
literacy output will result in elimination of variation and the other in the prolife¬ 
ration of variation. It is predictable that the standardisation process and its speed 
will be different in different registers and types of texts, and it will often be a single 
register which will activate the process of standardisation. As Orsatti argues. New 
Persian orthography came into being as part of the development of literary lyric 
poetry in the Persian courts. With the spread of the fixed orthography to other 
types of texts as well, the difference in genre did not disturb its stability. The sub¬ 
sequent irrelevance of genre for spelling conventions which became fixed is taken 
as proof of the antiquity of Arabo-Persian standardised orthography. 

Genre variation is also a factor behind the divergence of standardisation. The 
genre-specific distribution of (sub)standard microsystems has been reported for 
many manuscript cultures, including those discussed in this volume. Wagner 
shows that from the 13 th century onwards, the norms in the orthography of Judaeo- 
Arabic depended on the type of texts involved, unlike the previous period of the 
10 th -12 th centuries when standardised orthography cut across the genres. Equally, 
van der Putten points to a higher degree of consistency in doctrinal Islamic texts 
as compared to secular texts in the contexts of Malay manuscripts. Genre-specific 
orthography is elaborated by Sobieroj using a number of examples in xiaojing 
writing. The differences in Kabyle orthographic subsystems presented by Souag 
are to a certain extent motivated by different types of texts and literacy contexts, 
such as the needs of students attending a zaouia (religious school). 

A specialised standardised register may come into being as a result of trans¬ 
lating holy scriptures. Thus, Saadiah Gaon’s Bible translation facilitated the con¬ 
solidation of the consistent Arabicised orthography of standard Classical Judaeo- 
Arabic, which also spread to utilitarian writing in Fatimid Egypt {Wagner). 

In the context of Islam in West Africa, the translation of scriptures and cano¬ 
nical texts was a prominent factor in creating special literary registers in West 
African societies, leading to stable spelling conventions (Bondarev 2014, Bond¬ 
arev and Dobronravin, Ogorodnikova 2017, Tamari and Bondarev 2013). One side- 


28 For similar or related approaches to genre, see, inter alia, Bakhtin 1981, Barton 1994, Barber 
2007, Biber and Conrad 2009, and Lillis 2013. 
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effect of translational practices is the preservation of earlier linguistic features 
of language, for example, archaic Aragonese in Aljamiado manuscripts (de Cas¬ 
tilla), archaic Kanuri in Old Kanembu manuscripts (Bondarev 2013, Bondarev and 
Dobronravin ) or archaic Hausa in Hausa manuscripts (Dobronravin 2013). 

Different written genres could influence the distribution of linguistic norms 
and manuscript features, such as layout and script style. Schmidt discusses diffe¬ 
rent script styles applied to different types of texts in Ottoman manuscripts. What 
makes the case of financial documents written in siyakat script especially inter¬ 
esting is that the language of the texts included fixed sets of formulaic expres¬ 
sions and vocabulary and was meant to be clear and unambiguous, while the 
script ‘was intended to make the documents hard to falsify and hard for outsiders 
to understand or imitate’ (Darling 2012,180). 


7.6 Language 

I restrict this group of factors to a selected number of identifiable cases of lingu¬ 
istic structures motivating orthographic choices. These factors are followed by a 
separate sub-entry of ‘orthographic design and its uses’ as the topic typically dis¬ 
cussed in linguistics. Many factors mentioned in this section deal with linguistic 
features expressed at individual levels, therefore this group relates to the wider 
category of ‘genre’ in a similar way to ‘identity’ relating to ‘network’. 


7.6.1 Linguistic structures: phonology, morphology, syntax 

Many manuscript cultures have phonetic orthography at one stage of their deve¬ 
lopment or the other ( Wagner , Ivusic, Sobieroj, van der Putten, Souag, Gori). 
Phonetic transcription opens avenues for variation due to variation in spoken 
language conditioned by social status, register, dialect and other factors. An 
important feature of phonetic transcription is that it is largely based on scribes’ 
linguistic intuition. Spelling choices informed by speakers/writers’ intuition may 
form stable patterns at micro-levels of orthography with the resultant retention of 
such microsystems. What may first appear to be spelling inconsistency on closer 
examination turns out to be a stable orthographic tendency applied at micro¬ 
levels, which reflect fair guesses by the scribes about the structure of the language 
they were using in writing. The patterns of emergent spelling in manuscripts 
are in many respects comparable to the patterns discovered in the studies of 
children’s invented spelling (see an overview in Read and Treiman 2013). In spite 
of great variation from child to child, some patterns of spelling are systematic 
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and uniform in children’s writing. For example, one of the typical features is 
omission of nasals before consonants. This same systematic omission has been 
reported for many Arabic-based systems ( Bondarev and Dobronravin, Mumin and 
Versteegh 2013, Souag 2010). The systematic patterns of scribes’ linguistic intui¬ 
tion may be detected in various domains of the language: phonemic (e.g. Souag), 
phonological (e.g. Orsatti, Ivusic), morphological (e.g. Ivusic ) or syntactic (e.g. 
Dobronravin 2006). Taking such patterns into account may help us see distinct 
microsystems or ‘collections of unstandardised consistencies’ (van der Putten ) in 
manuscripts with considerable orthographic variation. In Bondarev and Dobron¬ 
ravin, we describe such microsystems as ‘stable sets of grapheme-phoneme cor¬ 
respondences’, highlighting the fact that orthographies may be composed of a 
variety of stable subsystems, each with its specific multiplex correspondences. 29 
For example, set 1 has a certain number of graphemes to cover a certain number 
of phonemes, whereas set 2 will have different graphemes to cover different pho¬ 
nemes, and set 3 will differ from sets 1 and 2, etc. Thus, the orthographic system 
which may be highly variable within one set might be remarkably stable in terms 
of contrast between the sets. 

The conditioning factors behind such microsystems of standardisation 
include salience and frequency. For example, phonetically salient items such as 
stressed vowels or syllables at the end of a phrase (Bondarev and Dobronravin) 
have high potential for being encoded uniformly. The frequency of items may 
be conditioned by purely language-specific peculiarities such as the frequency 
of function words like prepositions. But frequency can also be register-specific, 
some registers having more of one type of item and less of another (see Biber 
1995, for instance). 

As mentioned under ‘contact’, words borrowed from Arabic typically contri¬ 
bute to a standardised orthographic subset as they are usually written in the ori¬ 
ginal standard Arabic spelling, irrespective of their actual pronunciation in the 
recipient language. Specialised vocabulary is another factor of stability, as was 
the case with botanic items in Aljamiado (de Castilla). 

In this random overview of linguistic factors playing a role in standardisation 
processes, one special case worth mentioning is linguistic economy. In Ottoman 
Turkish, it was the ‘omission of whatever can be omitted, such as repetitive verb 
endings’ (Darling 2012,180), that counteracted standardisation. 


29 This approach is comparable to the treatment of graphemic and phonemic variation in Midd¬ 
le English manuscripts. Scholars use literal substitution sets for graphemes and potestatic sub¬ 
stitution sets for phonemes (Rutkowska and Rossler 2012, 222; LAEME - Linguistic Atlas of Early 
Middle English). 
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7.6.2 Orthographic design and its uses 

The phenomena discussed here are more related to the orthographic system as a 
whole rather than to individual choices accounted for by scribes’ phonological 
(and, generally, linguistic) intuition. 

Phonological differences between Arabic and non-Arabic languages result 
in orthographies with underspecification (one letter standing for more than one 
sound) and overspecification (more than one letter for one sound), which could 
either have (de)standardisation effects or have no effect whatsoever. A common 
way of overcoming divergences between Arabic and other languages’ phonolo¬ 
gies was the invention of new letters. The introduction of new letters explicitly 
leads to a break from the standard model of Arabic orthography, as was the case 
in Persian, Turkish, Malay, Kanuri, Hausa, Berber and other languages. In some 
manuscript cultures such as Persian ( Orsatti ), the new letters entered pre-exis¬ 
ting stable orthographic systems, while in others the invented letters initially 
increased the amount of variation in spelling, with subsequent stabilisation of 
orthography, as the history of Hausa writing shows ( Bondarev and Dobronravin). 

Historical orthography, often originating from certain literary registers, has a 
tendency to remain in manuscripts over a long period as standardised microsys¬ 
tems ( Orsatti , de Castilla, van derPutten, Bondarev and Dobronravin). 

Some orthographic designs derive from the Arabic model, but take on a dif¬ 
ferent function, as, for example, the otiose use of the Arabic letter <h> to indicate 
the final -a of lexical items ( Orsatti on Persian and Bondarev and Dobronravin on 
Kanuri). 

Idiosyncratic unsystematic spelling might be seen as chaotic and outside any 
considerations of norms. However, many studies, including this volume ( Wagner ) 
and elsewhere (Vandenbussche 2002; den Heijer 2012, 11), show that variability 
of spelling - even of the same word on the same page or line - was the result of a 
conscious desire to mix linguistic styles, and therefore such inconsistencies are 
better described as ‘distinct spelling systems’ (Vandenbussche 2002,32). Thus, in 
many manuscript cultures, a substantial variation in spelling may paradoxically 
be indicative of a standard norm. 


7.7 Medium 

7.7.1 Manuscript 

Ideally, this set of factors should address the manuscript-related causes of the 
process of standardisation as it may be manifested in manuscript culture. 
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However, the great breadth of ‘manuscript-related’ issues makes it open to taxo- 
nomically different factors. One group of factors conditions the standardisation 
of manuscript production - technology and husbandry will condition paper pro¬ 
duction and parchment production respectively, for example. The other group of 
factors is generated by manuscript production itself and has both self-inflicting 
influence (for example, the production of writing material such as parchment will 
dictate the size of manuscripts) and external influence (upon orthography, for 
example). Since standardisation processes in the manuscript cultures presented 
in this volume are largely explored in the domain of orthography, the complex 
issues of the relationship between codicological and linguistic features have only 
been touched on lightly. In the following paragraph, I shall only mention some 
manuscript-related factors as they were presented in some of the chapters. 

La Spisa points to the relationship between the norms of script type and 
layout: the change from the early Abbasid script to naskhi script led to a change in 
the whole structure of page layouts. Sobieroj observes that in some liturgical texts 
‘standardisation in handwriting and choice of format has been realised in mutual 
dependency’. He also discusses the influence of Arabic manuscript culture on the 
processes of standardisation in Khatt-i sini manuscripts (this factor overlapping 
with those discussed under ‘contact’). Schmidt demonstrates that standardisa¬ 
tion of script types correlated with standards regarding formats and layouts, and 
was prominent in Ottoman manuscripts. He also points to the ascendancy of cal¬ 
ligraphy over orthography: ‘Calligraphy was considered to be one of the highest 
forms of art, if not an esoteric science, compared to which mere orthography was 
insignificant’. 

An important factor underlying the uniformity of scripts and uniform or 
varied orthographic conventions was copying, which is not discussed in this 
volume, but was vividly described by Darling 2012, again using Ottoman Turkish 
manuscripts as examples. In the end, copying variation in manuscripts reduces 
the potential to invent one’s own spelling. 


7.7.2 Print 

Most of the cultures discussed in this volume existed before the print era or were 
outside its immediate sphere of influence. Some of the chapters here mention 
the interaction between manuscript and print cultures when it is observable 
( Schmidt , Sobieroj, Bondarev and Dobronravin ) and one chapter’s conclusions are 
mostly based on printed material ( Souag ). Other than that, the topic of print was 
considered beyond the book’s scope. Firstly, standardisation related to print cul¬ 
tures, especially in the context of standardisation of European languages, has 
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been studied much more than standardisation in manuscript cultures. Secondly, 
the transition from manuscript to print and its implications for standardisa¬ 
tion would require separate research. ‘Manuscript vs print’ is not a dichotomy. 
Contrary to the second quote in this chapter (Barton 1994,124), there is no clear 
divide between ‘the earlier scribal culture’ with ‘idiosyncratic formats’ produced 
by ‘individual copyists’ on the one hand and printing on the other hand, which 
‘allowed a large number of identical texts’ (for more subtle views on the problem, 
see the literature cited in the section on ‘comparative standardology’). Upon the 
advent of print, manuscripts were the models for book production, and with its 
advancement, printed material influenced manuscripts, the latter having persis¬ 
ted up to the present day in many corners of the world. 


8 Conclusions 

Continuing the dialogue between ‘comparative standardology’ and variation 
studies, this volume offers comparative data which allows us to delineate large 
groups of factors behind the standardisation process in manuscript cultures. 
The grouping into factors suggested here is by no means definitive and requires 
further elaboration. That said, I hope that the groups of factors discussed here 
will be helpful as a preliminary checklist for standardisation in interdisciplinary 
research. 

The discussion in this chapter draws on the assumption that standardisation 
can be understood better if it is considered a non-unidirectional process. The 
groups of factors can thus be seen as channels or pathways of ^(standardisa¬ 
tion. A comparative analysis of such pathways in the twelve manuscript cultures 
addressed in this volume permits us to postulate some generalisations, as follows. 

Contact situations do not necessarily lead to the exchange of standardised 
orthographic principles. In many cultures, the co-existence of Standard Arabic 
and non-standardised languages spoken and written in Muslim communities 
poses a paradox: such languages are profoundly influenced by Arabic, but their 
orthographies are not modelled on the principle of standardisation. 

This apparent paradox is resolved by the prediction that standards in ortho¬ 
graphy - one of the domains of manuscript culture - are conceptually different 
from standards in other domains, such as format, layout and script. As there is 
a difference between the standardisation of spoken language and written lan¬ 
guage, there is also a difference between the standardisation of orthography and 
other domains of manuscript production. 
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Each domain of manuscript culture develops microsystems of standardisa¬ 
tion. This observation seems especially pertinent to the understanding of varia¬ 
tion and standardisation in orthography. Orthographic variation is rarely arbit¬ 
rary. Rather, it is organised in distinctive grapheme-phoneme subsets. Thus, the 
orthographic system which may be highly variable within one set might be remar¬ 
kably stable in terms of contrast between the sets. 

Different domains have different ‘areas’ of uniformity and standardisation 
in a given manuscript culture. Layout and script type are one such area, which 
is often subdivided into two: layout, having a wider regional and cultural scope 
of uniformity, and script type, having a narrower scope of uniformity. The other 
area of uniformity, which is not necessarily linked to the layout/script domain, is 
orthography with its own principles of (non-/de-)standardisation. 

Thus, a general tendency observable at the level of physical features of 
manuscript production is that layout and script types tend to be unified, irrespec¬ 
tive of orthographic norms and, vice versa, orthographic norms develop irrespec¬ 
tive of norms applied to physical domains of manuscript production. 
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Persian Language in Arabic Script: 

The Formation of the Orthographic Standard 
and the Different Graphic Traditions of Iran 
in the First Centuries of the Islamic Era 

Abstract: This paper offers a critical review of the orthography of the most ancient 
original New Persian texts written in the Arabic (11 th century), as well as in other 
scripts (Hebrew, Syriac and Manichaean), to discover any indirect evidence about 
the beginnings and the formative period of Arabo-Persian orthography. On the 
basis of the New Persian original documents in non-Arabic scripts here examined, 
dating to an earlier period than documents in the Arabic script, we can tentatively 
date and localise the beginning of the Arabo-Persian orthographic influence on 
the other written traditions of Iran: northeastern Iran, end of the 9 th - beginning of 
the 10 th centuries. By then, Arabo-Persian orthography appears as already fixed. 
Though it cannot be excluded that some scattered and unsystematic attempts at 
adaptation of the Arabic script to Persian were accomplished here and there in dif¬ 
ferent places of Iran, the hypothesis of a multi-centric origin of the Arabo-Persian 
orthographic canon seems less probable in the light of our documentation. 


1 Introduction 

The aim of this study is to carry out a critical review of the orthography of the most ancient 
original New Persian texts written in the Arabic, as well as in other scripts (Hebrew, 
Syriac and Manichaean), to discover any indirect evidence about the beginnings and 
the formative period of Arabo-Persian orthography. 1 By New Persian, or simply Persian, 
I am referring, in a broad sense, to the Persian language of the Islamic period. 


Though agreeing with Giorgio Banti’s criticism expressed during the Hamburg conference in 
2013 about the use of the term ‘standard’ for pre-modern times, I retain it to underline the fact 
that New Persian orthography in Arabic script seems to follow a well-established norm from the 
time of its most ancient original attestations. 


1 A general study of the most ancient New Persian documents in alphabets different from Ar¬ 
abic is given by Orsatti (2007,107-172, with reference to previous studies), and - from a histori- 
co-cultural perspective - by Orsatti (2008). For the layout of the page in Islamic manuscripts 


3 Open Access. © 2019 Paola Orsatti, published by De Gruyter. [©1311333] This work is licensed under 
the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-002 
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2 The adaption of the Arabic script to New Persian 

As is well-known, towards the mid-7 th century western Iran, the heart of the 
Sasanid empire, was conquered by the Arabs. This historical event had many 
important consequences: most Iranians, more or less gradually, gave up their 
ancient Zoroastrian religion and converted to Islam; and - given the strong link 
existing between writing and religion - they abandoned the ancient Pahlavi 
script, in which what we call Middle Persian was written, and adopted the Arabic 
alphabet to write their language. 

The first original written records of Persian in Arabic script (I will speak of 
‘Arabo-Persian’), 2 both literary and not, go back to the 11 th century: therefore 
they are not really so ancient. We do not know when, where and for what pur¬ 
poses (administration, literature, private documents etc.) the Arabic script was 
first adapted to write the Persian language. And we do not know in which way 
and through what stages Arabo-Persian orthography was fixed. From the extant 
documentation we get the impression that an orthographical norm was well- 
established from the very beginning. On the other hand, at least for literary texts 
which have undergone a long manuscript transmission, we do not know to what 
extent the normalising intervention of copyists may have contributed to suggest 
the existence of an early orthographical norm. 

Historical sources can help find out when and for what purposes New Persian 
emerged as a written language, prevailing over Middle Persian, written in the 
Pahlavi script, and over Arabic, the language of religion and science. In the first 
centuries of Islam the prestige of Middle Persian was still strong, especially in 
south-western Iran. From a piece of information provided by the Arab historian 
Baladuri (9 th century) we know that, in western Iran, Middle Persian in Pahlavi 
script was used for administration until the end of the 7 th or the beginning of the 
8 th century, and in eastern Iran even longer, before being substituted by Arabic. 3 
For administration, therefore, Middle Persian was replaced directly by Arabic, 
and only later was New Persian used as the language of administration. The same 
can be said of epigraphy (Bivar 1986): for inscriptions, Middle Persian in Pahlavi 


and its relationship to the text copied, manuscript typology and destination, I again refer to 
some observations in Orsatti (1993, 281-282, 319-323). On the historical orthography of New Per¬ 
sian texts in Arabic scripts, apart from scattered remarks on the orthography of manuscripts in 
the introductions to the editions of texts, see Sirvani 1974 and Hashabeiky 2005, 79-85 (remarks 
based on an early study by falal Matini). 

2 The term ‘Muslim Persian’ is less appropriate because different minorities in Islamic Iran, 
especially Christians, also commonly used the Arabic alphabet (see below). 

3 See Xanlari 1986,1, 307-314. 
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script was gradually replaced by Arabic, written in the so-called Kufic script, one 
of the most ancient inscriptions in New Persian being the Kufic inscription in 
Persian verses in the palace of Mas'ud III at Ghazna (Afghanistan), dated 505/1111 
(published by Bombaci 1966). 

What is sure is that, when in the 9 th century, during the period often referred 
to as the ‘Iranian renaissance’. New Persian lyric poetry of the literary type - i.e. 
patterned after Arabic poetry - came into being in the Persian courts of eastern 
Iran, it could not be written down other than in the Arabic script. Indeed, New 
Persian lyric poetry arose as an experiment in the Persian courts, by then inde¬ 
pendent or semi-independent from the caliphate of Baghdad; an experiment that 
consisted of substituting Persian for Arabic within the pattern of Arabic poetry. 4 
We can suppose that the establishment of New Persian orthography was a part of 
this experiment. 

The anonymous author of a local chronicle, the Tarikh-i Sistan ‘History of 
Sistan’ (11 th century, with later additions), recounts an interesting story about what 
he presents as the first Persian poem of the Islamic era, which - he says - was 
told for Ya'qub b. Layt of the Saffarid dynasty of Sistan (south-eastern Iran). In 
251/865 Ya'qub gained an important victory over one of his enemies; on this occa¬ 
sion poets recited poems in Arabic in front of him, extolling his victory. Ya'qub, 
who was a man of arms and not a learned person, said then the famous phrase: 
‘Why should a poem be composed which I cannot understand?’ Muhammad b. 
Wasif, his correspondence secretary, was in the audience; he had the idea of com¬ 
posing a poem in Persian, of which six lines, full however of Arabic words and 
expressions, are quoted ( Tarikh-e Sistan: 166-167). 

The story of Ya'qub the Saffarid and his secretary and court poet Muhammad 
b. Wasif is an important piece of information about when, where and in which 
way the new Arabicised poetry in the ‘vulgar tongue’ of the time, that is New 
Persian, came into being: during the 9 th century, in the Persian courts which flou¬ 
rished in eastern and north-eastern Iran. 5 Prose began later: the first prose texts 
in literary New Persian, in Arabic script, go back to the mid-10 th century - the 
most ancient dated New Persian manuscripts of literary works in Arabic script, 
however, belong to the middle of the following century. 


4 On these themes cf. in particular Bausani 1960, 307-311 and the studies by Lazard 1971 and 
Lazard 1975. 

5 For an evaluation of this and similar accounts concerning the birth of New Persian literary 
poetry, cf. in particular Lazard 1975, 607-610. 
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2.1 Persian in Arabic script: general characters 

The following table shows the most important characteristics of the adaptation of 
the Arabic alphabet to the Persian language (mid-ll ,h - mid-13 th century): 


A. Conservation of the ‘only Arabic’ letters, i.e. letters found only in Arabic loanwords: 


Arabic letters 

Arabic value 

Phonetic value in Persian 

dj <t> 

0 

s 

Arabic i <d> 

8 

z (perhaps pronounced 8 in Early 
New Persian) 

C <h> 

voiceless pharyngeal fricative h 

h 

<s> 

pharyngealised s 

s 

u- 3 <d> 

pharyngealised 8 or d 

z 

Ja <t> 

pharyngealised t 

t 

ii <Z> 

pharyngealised z or 8 

z 

<3 <q> 

q 

introduction of the new phoneme 
/q/, in later times merged into /y/ 
<g>, but retained till now in 

Afghani Persian 

t <‘> 

voiced pharyngeal fricative 

not pronounced or pronounced as 
a glottal stop (merging with the 
Arabic hamza [?]) 


B. Introduction of new letters, mainly by addition of diacritical points 


Letters 

Phonetic value 

Commentary 

<—J 

P 

already introduced, as the three other 
‘Persian’ letters (see below), in the most 
ancient manuscripts, especially of fine 
workmanship, but normally seldom used 

5 

c 

see above 

3 

z 

see above 

i _£ 

g 

see above 

—ii <b> 

p 

spirant allophone of /b/. This letter is used 
only in the most ancient manuscripts 

‘Persian’ J <d> 

8 

spirant allophone of post-vocalic /d/, used 
in manuscripts until the middle of the 13 th 
century 
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C. One digraph, which represents the phoneme /xw/ of Early New Persian (this beingthe only 
case of historic orthography in the New Persian writing system, conserved till now): 


ji- xw later merged with/x/with dropping 

of the labial articulation of the ancient 
phoneme 


One of the main differences between the Arabo-Persian writing system and the 
Arabic one is the regular spelling of final short -a, which in Arabic is not written, 
by means of <h>. This is well explained by a passage from the Mu'jam by Sams-i 
Qays (first half of the 13 th century), where the great Persian philologist says that: 
‘In Persian orthography ( darxatt-ipars!), whenever a word ends in -a, you have to 
add a ha to it [...] These has in the Arabic language are clearly pronounced [i.e. in 
their value as h] [...] Instead in Persian {parsi ) they are by no means pronounced 
except, by poetic license, if they are in rhyme position; (only) in this case they 
are counted as a quiescent letter and are feebly articulated’ (Sams-i Qays 1981, 
243-244). 

2.2 Written and not-written morphemes 

An important characteristic feature of Arabo-Persian orthography is the way two 
important and very frequent morphemes of the Persian language are represented: 
the idafa, that is the particle -i of the status constructus, and the coordinative con¬ 
junction u (English ‘and’). Both consisted in a (probably) short vowel, and there¬ 
fore - according to the Arabic writing system - should not have been written; but, 
while the first one - in contrast with the pre-Islamic graphic traditions of Iran - is 
not represented in the Arabo-Persian orthography, the second one is written <w> 
as a stand-alone word. 

The idafa particle is an enclitic -i vowel, originally long but probably shor¬ 
tened from the beginning of the New Persian period, which follows the head of a 
nominal phrase; kitab-i mu'allim ‘the book of the teacher’, kitab-i buzurg ‘the big 
book’. In Arabo-Persian orthography this short -i is not written, except after words 
ending in a vowel: only in this case is the idafa represented as <y> (the letter yd) 
attached to the preceding word - what is represented in such cases is the glide y 
which develops between the final vowel of the word and the idafa vowel: banda- 
yi tu ‘your slave’. In the most ancient Persian manuscripts this orthographic rule 
is always observed, apart from very rare cases in which the idafa is represented by 
<y> also after a word ending in a consonant (Lazard 1963, 200, § 162). A couple of 
such cases, pointed out by Minorski (1942,188), seem to be present also in one of 
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the most ancient non-literary Arabo-Persian documents, the deed for the sale of 
land found in Khotan (Central Asia) (line 2), dated 501/1107. Though it is generally 
admitted that in New Persian the vowel representing the idafa had already been 
shortened, these occasional spellings, as well as the fact that in poetry the idafa 
can count as a short or as a long vowel, seem to point to the presence - in Early 
New Persian - of a long variant of the idafa (Meier 1981,131-132). 6 

An ancient spelling of the idafa particle by means of an isolated or proclitic 
alif, attested in the quotation of Persian phrases in works of some Iranian authors 
(Abu Nuwas, Hamza al-Isbahanl, the author of the Ta’rix-i Qumm) of the 9 th -10 th 
centuries writing in Arabic, may represent an ancient spelling, soon fallen into 
disuse, preceding the establishment of Arabo-Persian orthography (Henning 
1958, 88-89); this is even more relevant in the light of the fact that the authors in 
which this spelling is to be found are from central and western Iran. 

A group of fifteen Judaeo-Persian tombstones edited by Gnoli (1964) found in 
the Ghur (Afghanistan) and dated - according to the Seleucid era - to the years 
between the second half of the 12 th till the beginning of the 13 th century CE, show 
the gradual falling into disuse of the writing of the idafa, perhaps due to the 
influx of coeval Arabo-Persian orthography, and possibly also as a consequence 
of the disappearance of the long idafa. In the initial formula wafat-i ‘adan ‘Edenic 
death’, the idafa is written as <-y> joined to the preceding word in tombstones IV 
and VIII (dated 1484 Seleucid/1172 and 1502/1190 respectively), and is not written 
in tombstones III (also dated 1484 /1172), V, VI, VII, IX, X, XI (this being the most 
recent one, dated 1526/1214), and XIV. 

Besides the idafa particle - which, being a (short) vowel, very soon disap¬ 
peared from orthography though remaining well alive in the language - other 
morphemes are not represented in the Arabo-Persian orthography. One of them, 
consisting in an originally long o afterwards shortened, is the directional Middle 
Persian preposition o ‘towards, to’, well-known for the New Persian linguis¬ 
tic period from the most ancient Judaeo-Persian documents originating from 
south-western Iran (Lazard 2009; see below. Group C). Its disappearance from 
Arabo-Persian orthography was perhaps due to the fact that this morpheme was 
considered too dialectal, colloquial or archaic to be allowed into the writing; or, 
rather, its disappearance may well correspond to its gradual falling into disuse 
from the north-eastern variety of Persian which was at the basis of literary New 


6 See also an inverse spelling in a New Persian fragment in Manichaean script, pointed out by 
Provasi (2011, 162, 164): in fragment M 595a+, dang, ‘a small coin’, followed by idafa is written 
<d’ngyh>, with the abstract suffix -i (spelled -yh) representing a probably long idafa. 
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Persian, the so-called dari or parsi-yi dan. 7 Indeed, in the extant New Persian 
documentation in Manichaean script, mainly originating from Samarkand and 
eastern Iran between the 10 th and 11 th centuries, there is no trace of it; and it is 
rarely attested in the ancient Judaeo-Persian texts from north-eastern Iran (Paul 
2003,183; Shaked 2009, 452). It is possible therefore that the preposition o, shor¬ 
tened in New Persian and not represented in Arabo-Persian orthography, was 
gradually replaced in common usage by other more fortunate prepositions, as 
the polysemous ba ‘in, to, on, by, with, for’; and that it was then expunged from 
literary New Persian, though surviving in many Persian dialects especially in the 
western part of the Iranian plateau (Browne 1895; Filippone 2011,198). However, 
this preposition seems to have been transmitted as a legacy to the common lan¬ 
guage based on literary New Persian. An indirect trace of it - no longer written 
nor pronounced - can be detected in a number of cases, typical of the contem¬ 
porary spoken language (ex. miram xune ‘I am going home’, instead of be xane 
miravam), where a preposition is lacking or has fallen away (Lazard 1986, 252; 
Lazard 1990,189). 

The same can be said of the use of the idafa particle as a relative pronoun, 
well-known from New Persian texts in scripts different from Arabic. In this case 
too, more modern forms replaced the outcome of the old Middle Persian relative 
pronoun f; but scattered traces of it can be detected in literary Early New Persian 
texts (Lazard 1963,490-491, §§ 855-856), as well as in the modern spoken variety 
(Lazard 1990,188-189). 

Arabo-Persian orthography does not distinguish between some originally dif¬ 
ferent morphemes: the Middle Persian conjunction ku ‘that, than’, the relative/ 
interrogative pronoun ke ‘who, which’, and the temporal/conditional conjunc¬ 
tion ka ‘when, if’, all three merged in what seems to be a single new form ki, 
spelled <ky> or <kh>, or simply <k> joined to a preceding pronoun (<’nk> an-ki 
‘that which’) or to a following word beginning with a vowel (<k’md> k-amad ‘who 
came’) in early Arabo-Persian orthography. 8 These morphemes were still distinct 


7 For the meaning of these glottonyms, and the distinction between parsi and dari, see the 
studies by Gilbert Lazard reprinted in the book La formation de la langue persane, Paris 1995. 
Roughly speaking, parsi means generically ‘the Persian language’, while dari, ‘(language) of the 
court’, indicates - at the beginning of the Islamic era - the north-eastern variety from which 
literary New Persian sprang. 

8 Sams-i Qays (1981, 249) says that both spellings, with final <y> or <h>, are only a graphic de¬ 
vice intended to represent a preceding short i vowel ( kasra-yi ma-qabl). When ki is used with an 
interrogative and abstract (istifham-i mujarrad ) value, as in the expression 6 Id-st? ‘Who is he?’, 
however, Sams-i Qays recommends the spelling with final <y> which, he says, ‘is well perceptible 
in pronunciation too’. Perhaps he meant that, when this form had the value of an interrogative 
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- at least graphically - in some of the most ancient Judaeo-Persian texts, but 
often appear to be used interchangeably as far as their function is concerned; and 
ka ‘when, if’, spelled <k’>, has been retained only in the text of religious contro¬ 
versy referred to as Argument, published by Mackenzie in 1968 (Paul 2013, 151— 
152, § 185c, and 168-169, § 207). Manichaean New Persian too, despite its adhe¬ 
rence to the graphic tradition of Middle Iranian languages, especially Sogdian 
and Middle Persian, in Manichaean script, shows some traces of confusion (see 
de Blois 2006,106, s.v. k’; Provasi 2011,166-167, s.w. k’, kw, ky). This means that 

- during the formative period of Arabo-Persian orthography - such morphemes 
were losing, or had already lost, their distinct meaning due to a possible formal 
coalescence caused by the shortening of the vowels (kii > ku, ke > ki/ke). This pro¬ 
bably helped their merging into one single form. 

Some of the earliest Judaeo-Persian documents - from both south-western 
Iran (as for example Argument; but not the two dated documents referred to as 
Group C below) and Central Asia (for example the two letters from Dandan Uiliq 
in Central Asia: Group B below) - give evidence of the existence of two diffe¬ 
rent prepositions, corresponding to the sole Arabo-Persian preposition ba(d) 
written <b-> (or <bd-> before a vowel): 9 pa(d) from Middle Persian pad ‘to, at, 
in, on’, spelled <pd>, <p’>, or <p-> joined to the following word; and <by>, to be 
probably read be, with a directional value ‘to, towards’. These two prepositions 
are also attested - respectively spelled <b-> and <by> and with the same distinct 
meaning - in an interlinear translation of the Qur’an, in Arabic script, written in 
a particular variety of New Persian rich in dialectal features attributable to Sistan 
(south-eastern Iran). The manuscript of this text, referred to as Qur’an-i Quds and 
dated by Lazard to the end of the 11 th century (1990,188,192), has been edited, in 
facsimile and in a diplomatic edition, by RivaqI (1985). 10 

As to the genesis of the Arabo-Persian preposition ba(d), apart from a possi¬ 
ble coalescence between the two prepositions pa(d) and be mainly evidenced by 
pa(d) also acquiring a directional value, 11 contamination with the almost syno¬ 
nymous Arabic preposition bi- ‘with, in, by, at, near’ can also be cited, at least as 


pronoun, it was still articulated with a long e or i, as it is today. On the form of this/these 
morpheme(s) in the most ancient New Persian prose texts in Arabic script see Lazard 1963, 237, § 
253 (pronoun); 473, § 809 (conjunction). 

9 Lazard 1963, § 145, p. 191 shows that vocalised manuscripts also attest an occasional bi or bu 
(in labial context) pronunciation. 

10 A clear study of the repartition of the two prepositions’ usages has been given by Lazard 
(1986, 245-247 and 1990,187-188). 

11 Paul (2003,179-185, and especially the table on p. 185) shows that it is only in New Persian 
that ancient pa(d) also acquired a directional meaning. 
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far as its spelling with initial <b-> is concerned (de Blois 2006,109, n. 8). 12 Indeed, 
the Arabo-Persian preposition is spelled with initial <b-> - instead of <p-> attes¬ 
ted by coeval texts in scripts different from Arabic - even in the most ancient 
manuscripts which make at least occasional use of the four ‘Persian’ letters. The 
Manichaean New Persian fragment M 595a+ edited by Provasi shows a curious 
inverse spelling for the verbal prefix bi-, written <pd> as the preposition (Provasi 
2011, 161-162, 166). This might indicate that the rather inaccurate scribe of this 
fragment confused the two morphemes, perceiving them as homophonous (at 
least as regards their initial consonant); or that he was influenced by the Arabo- 
Persian orthography, in which the verbal prefix and the preposition were both 
spelled <b->. 


2.3 The rule of dal and dal 

As seen above, a clear normalizing aim seems to have presided over the esta¬ 
blishment and development of Arabo-Persian orthography. This ended up with 
the exclusion of all forms considered too dialectal or colloquial from writing, and 
therefore from literary New Persian; and with the simplification of a number of 
forms which had lost their distinct original form or meaning. 

Among the normalizing choices connected with the development of early 
Arabo-Persian orthography there is - to my mind - the so-called ‘rule of dal and 
dal’ well-known from the most ancient literary manuscripts. According to this 
rule, all ds after a vowel or a diphthong, within the same morpheme, were spelled 
<d>, i.e. as an interdental voiced fricative (ex. <pdr> piSar ‘father’); instead, after 
a consonant or at the beginning of a word (ex. <drd> dard ‘ache’) or after mor¬ 
pheme boundary (ex. <bd’n> bi-dan ‘know’), they continued to be spelled <d>. 
The complementary distribution of d and S implied by this rule concerned only 
the words of Persian (or Iranian) origin and the loanwords, such as those from 
Greek, entered into Persian at an ancient date. By contrast, the development of S 
from postvocalic d did not concern Arabic loanwords, probably because in Arabic 
/d/ and /d/ were (and are) two distinct phonemes. As manuscripts follow this 
orthographic usage approximately until the middle of the 13 th century, scholars 
have supposed that, around that time, the complementary distribution of the 
outcome of historical /d/ in Persian words, as d or as S, came to a halt and all 


12 Martin Schwartz (personal communication) suggested another possible explanation for 
pa(d) > ba(d) in New Persian: alignment with the initial b- of many other prepositions ( abar , 
abag, abe, abaz ), that by then had lost their initial vowel. 
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‘Persian’ 5s (but not Arabic ones) became d again, as they are today, with the 
exception of a small number of words which retained S - afterwards pronounced 
z - from historical d: guSastan ‘to pass’, guSastan ‘to put’, paSiruftan ‘to receive’, 
and a few others. 

To explain this change in the manuscript tradition, Pisowicz (1985,109) for¬ 
mulated another hypothesis, namely that from the second half of the 13 th century 
the allophonic development of the ancient postvocalic d to S - which he conside¬ 
red characteristic of central and western dialects - was ousted from the literary 
language due to the influence of north-eastern dialects, where this phenomenon 
had not occurred. Indeed, in a famous passage from the Mu'jam, Sams-i Qays 
says that ‘in the language of the people of Ghaznin, Balkh, and Transoxiana there 
are no dais, all of them being pronounced as the letter without points [i.e. dal]' 
(Sams-i Qays 1981,221); this - according to Sams-i Qays - would explain a number 
of irregular rhymes, which do not comply with the distinction between dal and 
dal, in the works of poets coming from these regions. Meier (1981,104), however, 
had already shown that the spirantisation of postvocalic d was not unknown to 
the north-eastern dialects, considered as a whole; according to him the boundary 
between the areas affected and not affected by this phenomenon passed between 
Marv in Khorasan (spirantisation), and Bukhara in Transoxiana (without spiran¬ 
tisation). Pisowicz’s hypothesis appears problematic also because the influence 
of north-eastern dialects on the literary language was certainly relevant from the 
very beginning of the history of literary New Persian; therefore it is unlikely that 
such an influence had had consequences only in the 13 th century. 13 When Sams-i 
Qays says that ‘in the language of the people of Ghaznin, Balkh, and Transoxiana 
[i.e. Afghanistan and Central Asia] there are no dais’, he is simply noting that 
not all north-eastern dialects have a spirant allophone of /d/, this causing some 
difficulties for the poets coming from these regions; but he is not stating anything 
about the complementary distribution of d and S implied by the orthography of 
the most ancient manuscripts. 

Recently Filippone, in the context of a study of the language of the Qur’an-i 
Quds, where no instances of spirantisation of postvocalic /d/ in words of Persian 
origin are to be found, has conducted research on Persian dialects to find out 
the dialect or the dialects from which the complementary distribution of d and 
S represented in literary ancient manuscripts could have originated. She conclu¬ 
des: ‘In analyzing these phenomena, I think that one should avoid reconstructing 


13 Recent summaries of this complicated question have been given by de Blois 2006, 94-96; 
Orsatti 2007, 94-98 (with translation of the relevant passage from Sams-i Qays); Filippone 2011, 
185-186. 
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highly standardised realities, with a homogeneity in time and place which has 
probably never existed. The tendency to a certain (contextual or free) variability 
of d/S/z appears as a constant factor throughout the history of West Iranian. [...] 
But cases of d/z fluctuation are mostly bound to single words. In many cases they 
remain unexplained’ (Filippone 2013,186). This is exactly the situation of modern 
standard New Persian, where dal (pronounced z) from historical postvocalic /d/ 
is only found in a limited number of words. 

At this point, a possible solution would be to consider the complementary 
distribution of dal and dal attested in early literary manuscripts as the result 
of the application of a (mainly orthographic) rule intended to set order into 
the multiform realisations of /d/ in the spoken or dialectal varieties of Persian. 
Sams-i Qays, in the previously mentioned passage, attributes the complemen¬ 
tary distribution of dal and dal to the ‘correct dan language’, that is to literary 
New Persian, and not to Persian tout court. Throughout Chapter Two of Section 
Two of the Mujam, devoted to ‘The letters of the rhyme’ (where - concerning 
the rhymes of the letter dal - the passage in question is to be found), he distin¬ 
guishes carefully the Persian language, called parsi or lugat-i parsi, from the 
lugat-i dart, ‘the dart language’, often qualified as sahih ‘correct’. For example, 
he says that in the Persian language {parsi) most words end with a quiescent 
letter, i.e. - roughly speaking - in a consonant (Sams-i Qays 1981, 209); and 
that among the peculiarities of the Persian language {lugat-i pars!), there is the 
fact that the clause is not complete without the copula (Sams-i Qays 1981, 215). 
When he notes that in Persian the final long -a is normally shortened, he refers 
this linguistic notation to the colloquial form of Persian {muhawarat-i pars f) 
(Sams-i Qays 1981, 211). When the author uses the term darl, instead, he refers 
to a rule or a canon, often even complaining of the lack of a clear criterion on 
which to rely (Sams-i Qays 1981, 205). 

That the rigorously complementary distribution of d and S was a phenome¬ 
non only affecting the literary language is proved by the fact that it seems to 
be unknown in New Persian texts of non-literary character. Neither the ancient 
New Persian texts in Hebrew script coming from south-western Iran (see Group 
C below), nor the non-literary Qur’an-i Quds in Arabic script, from Sistan, bear 
any trace of a spirant pronunciation of postvocalic /d/ (Lazard 1995, 136), and 
even less of a complementary distribution of d and S. Among the New Persian 
texts written in scripts different from Arabic only one shows a clear complemen¬ 
tary distribution of d and 8: it is the fragment of the bilingual (Syriac, and New 
Persian in Syriac script) Psalter from Central Asia (edited by Muller 1915, Sun- 
dermann 1974, and Sims-Williams 2011). Sundermann, however, ascribes the 
orthographic usage in this manuscript to the influence from the coeval Arabo- 
Persian orthography (1974, 450). The Manichaean New Persian orthography, 
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though having at its disposal a distinct letter <8>, attests only occasional ren¬ 
derings of Persian postvocalic /d/ as S in <’ry8> araS ‘he brings’ and <nbw8m> 
nabuSam ‘I was not’ in the fragment of the qasida published by Henning (1962, 
11. 10 and 34); and in <gw8r’ndg> guSaranda ‘passing’ in the so-called ‘Cate¬ 
chism’ ( Lehrtext ) published by Sundermann (2003, c2, where this form occurs 
twice, this verb being one of the few Persian words that have retained the letter 
dal till now): a total of only three words. 

Therefore, the complementary distribution of d and S should probably not 
be considered as a genuine linguistic phenomenon originating from one or more 
Persian dialects and hence entering into the literary language, but as a rule - in 
fact, ancient authors often speak of ‘rule’ - that was supposed to be applied to 
literary New Persian {dari) and had its main scope and field of application in 
orthography and in the scholastic and artificial pronunciation characteristic of 
poetry (see also Section 2.5). Probably around the mid-13 th century such a rule 
ceased to be extensively applied in the copying of literary manuscripts. 


2.4 The most ancient New Persian texts in Arabic script: the 
Codex Vindobonensis and the marriage contract from 
Bamiyan 

One of the most ancient dated Persian literary manuscripts is the Codex Vindo¬ 
bonensis (cod. A.F. 340 of the Osterreichische Nationalbibliothek in Vienna), 
containing a pharmacological tractate by Abu Mansur Muwaffaq b. ‘All al-Hirawi 
(therefore from Herat in modern Afghanistan), copied by the poet Asad! of Tus in 
Sawwal 447 / 24 December 1055 - 21 January 1056 (facsimile editions: Muwaffaq 
1972, 2009). 14 A relatively short span of time separates the composition of the 
work, which can be dated to the second half or end of the 4 th /10 th century, from 
this copy. Both the author and the copyist were from Khorasan, in the east of the 
historical Iranian territory. 

The orthography of the manuscript is not too different from today’s. The 
Arabic loanwords are spelled as in Arabic, though some orthographic usages of 
Arabic are not retained: a principle which has presided over Arabo-Persian ortho¬ 
graphy over the years. The ta marbuta alternates with the ta tawila Tong fd’, both 
read -at, or with final <h>, read -a. In particular, taking into consideration the 
second double-page of the manuscript (fols 2v-3r), the ta marbuta is used in jihat 


14 For a critical evaluation of the manuscript for linguistic studies and a bibliography until then 
cf. Lazard 1963, 45-48, N. 4. 
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Fig. 1: Codex Vindobonensis, fols lv-2r (from Muwaffaq 1972). 
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Fig- 2: Codex Vindobonensis, fols 2v-3r (from Muwaffaq 1972). 
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(2v 9), sa'adat (2v 10), hadrat (3r 1 and - probably with loss of the two points - 2v 
5), and ziyadat (3r 1); the ta tawila is found in quvvat (2v 1, followed by the plural 
suffix -ha), in quvvat u madarrat u manfa'at-as (3r 5, before the suffix pronoun 
-as), and in qismat (3r 7); instead, the writing with final <h>, certainly read -a 
(in fact, it is followed by the <y> indicating the idafa after a vowel), is found in 
two Arabic words: xizana (3r 2) and in daraja (3r, 5, 8-12). Hamza is not written 
in muayyad (2v 5), ta’ammul (2v 7) and ta’lif(3r 2). The latter spelling could actu¬ 
ally represent a linguistic feature of the language of the text: the dropping of the 
glottal stop with compensatory lengthening of the preceding vowel (Meier 1981, 
128,133-134). 

In this manuscript, final <y> - besides its normal form(s) - also has a small 
form, mostly used to represent the idafa after a word ending in -a (see lv 4,5; 2r 2; 
3r 2,5,8-12). This small yd can be considered as the origin of the ‘Persian’ hamza, 
i.e. a hamza placed over or by a final <h> indicating -a. In fact, in fol. lv, a second 
hand seems to have begun to replace these Tittle yds’ with more modern hamzas 
(11. 4 and 5). 

The manuscript is fully vocalised and provided with orthographic signs, 
and complies with the ‘rule of dal and dal’. As to the so-called majhul ‘unknown’ 
vowels, i.e. the vowels e and o unknown to the Arabs, a vocalisation for e different 
from i is well attested (Meier 1981, 86-87), whereas I found no examples for o. 
A massive use is also made of the distinctive signs (additional diacritical points 
and little letters written above or below the main letters) intended, in addition to 
the usual diacritic points, to differentiate letters of the same form: 15 three points 
under sin and /cd/to differentiate them from sin and gaf; one point under ra, dal, 
sad and ta, to differentiate them respectively from za, dal, dad and za; a letter 
underwritten, to differentiate ‘ayn from gayn and ha from xa. Final <y> has often 
two subscribed points, as in Arabic writing, a graphic usage afterwards aban¬ 
doned in Arabo-Persian orthography. 

Among the other more relevant orthographic usages of this manuscript there 
are the following: 

the ‘Persian’ letters <p>, <c> and <g> are occasionally used (I have found no 
instances of <z>), all three written with three points below, but often replaced 
respectively by <b>, <j>, <k>; 

letter -4 representing the spirant allophone of /b/ occurs quite regularly in 
verbal forms from aBzudan ‘to add’ and aBgandan ‘to throw’, and in the suffix 
- Bam ‘color’; 


15 Grohmann 1971, 42-46, § 4. 
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- the preposition ba- ‘in, to, by’ ( <pa/paS ) is always written with< b> - and not 
with the more ancient <p> - attached to the following word, a usage contin¬ 
ued until recent times (now discouraged in favor of the ‘separate’ spelling 
<bh>); 

in ‘this’ and an ‘that’ are often written, without initial alif, joined to a preced¬ 
ing preposition {az-in 2v 3; andar-in 3r 3) according to a usage frequent until 
very recent times; 

- the conjunction ‘that’/relative pronoun ‘who, that, which’ are always written 
<ky>; 

- the verbal durative prefix has the form hame and is always written as a stand¬ 
alone word; 

alif-madda is only occasionally used; initial a- is often written as a simple 
alif, or as two alifs next to one another. 

Apart from minor fluctuations continuing until recent times, this already norma¬ 
lised orthography is attested not only in the most ancient literary manuscripts, 
but also in the two most ancient private documents in Arabo-Persian: a marriage 
contract dated 470/1078 found in Bamiyan (Afghanistan), 16 and a deed for the 
sale of land found in Khotan, dated 501/1107. 17 

Among the orthographic features of the marriage contract, written only 
about 20 years after the Codex Vindobonensis and originating from Afghanistan, 
the following should be noticed: 

- the orthography of the text is not too different from modern orthography, 
with the exception that here diacritical points are often omitted; 

all Arabic loanwords maintain their original spelling; 

- the four ‘Persian’ letters are not used: cf. for ex. <sbyd’r> sapedar ‘white 
poplar’ (11. 2, 3,12) and <krftn> giriftan (1. 25); 

- the ‘rule of dal and dal’ is not respected; in particular, two Persian words 
which even today are spelled with dal are written with <d>: <bbdrfth> 
bipadrifta ‘he has accepted’ (1. 28), corresponding to literary New Persian 
bipadimfta and <kwdsth> gudasta ‘passed, elapsed’ (1. 31), corresponding to 
literary New Persian gudasta; 


16 Published in Latin transcription/transliteration by Scarcia (1966). The same author gave a 
study of this text together with a photographic reproduction of it in a previous article (Scarcia 
1963). 

17 This document was first studied by Margoliouth (1903), who also offers a photographic 
reproduction of it. Minorsky (1942) gave a transcription and translation of the text and a new 
study, also correcting Margoliouth’s reading of its date: not 401 but 501 of the Hegira. 
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in 1.1 the preposition bar ‘on’ is written <vr> or <fr> (it is not easily readable 
from the photograph published by Scarcia 1963), a spelling which probably 
represents a dialectal form; 

the verbal prefix me- (< hame), which always occurs in this more modern 
shortened form, is written attached to the following verbal form; 
the preposition ba ‘in, to, at’, is written as <b-> attached to the following 
word; 

the conjunction ki ‘that’/relative pronoun ki ‘who, which’ are both written 
<kh>; 

the plural suffix, always occurring in the form -hay, is written attached to 
the preceding word, except after words written with final <y>; e.g. <sr’y h’y> 
sarayhay ‘houses’, 11.18, 25 and passim (there are no plural words ending in 
<h>); 

alif-madda is never used; 

final <y> has, though very rarely, two points below, thus indicating that the 
‘Arabic’ writing of final <y> with two points below, later abandoned, was still 
in use; 

the coordinative conjunction u/wa ‘and’ is not written, apart from in eight 
cases (11. 24, 25, 27, 28 twice, 29, 31 twice), mostly placed at the beginning of 
a new sentence. 

The latter feature is very interesting and represents the sole real divergence from 
the orthography of literary texts (and from modern orthography). It seems to 
suggest that in the manuscript of the marriage contract the short u of the conjunc¬ 
tion was not written, except when it was at the beginning of a sentence or after 
a pause. We can suppose that precisely at the beginning of a sentence or after a 
pause the Persian conjunction u (< Middle Persian u, lid) begun to be pronounced 
wa, as in Arabic. These, therefore, would be early attestations of the new form, 
probably influenced by the Arabic conjunction wa, of the Persian conjunction, 
even now pronounced va or o according to its syntactic position and elocution 
speed. 

Ancient literary manuscripts too - less carefully copied than the Codex 
Vindobonensis - show some traces of a failure to write the coordinative conjunc¬ 
tion. For example, in the ancient fragment of ‘Unsuri’s poem Wamiq wa Azra 
(datable to the 11 th —12 th centuries) the conjunction is occasionally not written and 
has been integrated into the edition (Hagg and Utas 2003, 79). 

Considering that the two ancient manuscripts analyzed here, that is the 
Codex Vindobonensis and the marriage contract, pertain to different textual 
typologies - a beautiful copy of a scientific-literary text and a legal private docu¬ 
ment - the orthographic differences between them are not so great. The lack of 
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vocalisation and of any orthographic sign in the marriage contract, as well as the 
defective diacritical pointing of the letters, are clearly to be connected with the 
practical scope of the document, devoid of any aesthetic pretension, and with 
the formulaic character of the text, written in a highly standardised language. 
The comparison between the orthography of the two texts shows a high degree of 
normalisation from early times, at least as far as Eastern Iran and (today) Afgha¬ 
nistan are concerned. 

In this regard one fact is particularly meaningful: in the two most ancient 
non-literary documents, i.e. the marriage contract from Bamiyan and the deed 
for the sale of land from Khotan, the Persian word pansad ‘five hundred’ (Scarcia 
1963,11. 3, 22; Margoliouth 1903,1.12) is already written with Arabic <s>, as it is 
now - sad ‘hundred’ being one of the few Persian words written with this ‘only 
Arabic’ letter. The reason for this spelling is perhaps to avoid confusion with 
other homophonous words such as sad(d) ‘rampart, obstruction’ or - as Perry 
(2002) suggests - with the very common word sud ‘(he) became’ which, due to 
the frequent defective writing of the diacritical points, could be confused with 
the word for ‘hundred’. The fact that both non-literary documents already offer 
this ‘normalized’ spelling is a clear proof of the high degree of standardisation of 
Arabo-Persian orthography from ancient times. 


2.5 The spelling of the Arabic loanwords 

As we have seen, one of the most striking features of Arabo-Persian orthography 
since its beginning is the preservation of the original spelling of the Arabic words 
which entered into Persian, though we can suppose that - once established in 
the Persian language - the Arabic loanwords were pronounced, as they are today, 
according to Persian phonology. The only exception are the introduction of the 
new phoneme /q/, in Early and Classical New Persian still clearly distinguished 
from /y/ <g> (Pisowicz 1985, 111-117) and, perhaps, the pronunciation of Arabic 
<d>: indeed, given the existence, in words of Iranian origin, of a fricative postvo¬ 
calic allophone of historical /d/, the letter dal could well have been pronounced, 
in Arabic loans, as an interdental voiced fricative (de Blois 2006, 94). 

The preservation of the original orthography of the Arabic loanwords is cer¬ 
tainly a consequence of their scholarly origin: the Arabic loanwords entered the 
Persian language mainly from books, from the written Arabic language; and only 
gradually, by ‘osmosis from above’ (Bausani 1978,13-14), did they penetrate into 
the everyday language. It was mainly the Persian (or Iranian) bilingual scholars, 
who knew and used Arabic as a scholarly language (as was Latin for European 
scholars), who were responsible for the introduction of a great quantity of Arabic 
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learned vocabulary into their works. If the spelling of the Arabic loanwords in 
Persian has to be taken as referring to real pronunciation, and not only to the 
written form of the words, it probably represents mainly a scholarly or literary 
pronunciation of Classical Arabic, i.e. an artificial and altogether scholastic 
pronunciation. The same artificial pronunciation of Arabic loanwords is cha¬ 
racteristic of the metrical reading of poetry. In Persian poetry, for example, the 
Arabic ‘ayn is always ‘pronounced’, i.e. counts as a consonant, even in positions 
where, in normal speech, it is (and probably was) never pronounced. Likewise, 
the hamza (glottal stop) of Arabic words is counted as a consonant; and this can 
happen even at the beginning of a word, before a vowel, when the hamza is and 
was neither written nor pronounced. Moreover, <z>, <d>, <d> and <z> never rhyme 
together, nor can <s> rhyme with <t> or with <s>, <h> with <h>, or <’> with <’> 
(Meier 1981,103). This seems to point to a sort of artificial and scholarly pronun¬ 
ciation of Arabic loanwords in Persian poetry. 

The preservation of the original Arabic orthography of loanwords has an 
important implication (and was also probably dictated by this need): it makes 
the Arabic loanwords immediately recognisable, without destroying the kinship 
between words pertaining to one and the same Arabic root. A kind of conscious¬ 
ness of the original written form of the Arabic lexicon within Persian has always 
been maintained, and is proved not only by the fact that Arabic words can be 
uttered (in poetry or in scholarly contexts) with a literary pronunciation, approxi¬ 
mating to that of Classical Arabic, but also by the fact that even now, in standard 
New Persian, the intervocalic glottal stop /’/ can be replaced by the glide y only 
if it is represented in the original writing by a hamza, but never if it is represen¬ 
ted by the homophonous (in Persian) letter 'ayn: for ex. /la’eq/ ‘worthy’ can be 
uttered as [la’eq], [layeq] or [laeq]; but /sa’er/ ‘poet’, from Arabic sa'ir, cannot be 
pronounced *sayer, given the origin of /’/ from an Arabic 'ayn in this word (Piso- 
wicz 1985, 20,102). 

The orthography of Arabic loanwords in Persian has remained virtually 
unchanged throughout the entire history of the New Persian written tradition, 
remaining impermeable to any influence from the different diachronic and dia¬ 
lectal varieties of Arabic (a relevant exception is represented by the Arabic loan¬ 
words ending in ta marbuta, on which cf. Perry 1991 and 1995). 

An accurate preservation of the original Arabic spelling for Arabic loanwords 
is to be found not only in Arabo-Persian orthography, 18 but also - where possi- 


18 One exception is the Persian orthography of words with hamza and their phonetic realisation, on 
which thorough historic-linguistic research is still lacking, apart from useful remarks in the excellent 
work on the history of the Middle and New Persian phonology by Pisowicz (1985,20,47-51,102). 
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ble - in the orthography of the New Persian texts in scripts different from Arabic. 
Indeed, for the redaction of New Persian texts, with their rich Arabic lexicon, the 
Hebrew alphabet had the possibility of transliterating many of the ‘only Arabic’ 
letters: het, as opposed to he, was used to transliterate Arabic ha; tet, as opposed 
to taw, was used for Ar. ta; 'ayin was available for Ar. 'ayn; sade, which in the 
adaptation of Aramaic-based scripts to the Iranian languages had already been 
employed for Iranian c (Skjaervo 1996, 516), and partially for /, was of course 
also suited to represent Ar. sad; and qoph was available for Ar. qaf. Moreover, 
for some of the Arabic and Persian sounds not represented in the Hebrew alpha¬ 
bet, the Judaeo-Persian writing system resorted to the possibility of representing 
the Hebrew spirant allophones of the plosives by means of diacritic signs: hence 
taw was used to represent both /t/ and, with or without diacritics, the letter ta 
of the Arabic loans; dalet was used for /d/ and, with or without diacritics, for 
Ar. dal; kaph for /k/ and for /x/ (in both Arabic and Persian words); pe was used 
for Persian /p/ and for Arabic and Persian /f/. The only Arabic letters which had 
no possible graphic equivalents in the Hebrew alphabet were dad and za, for 
which - as well as for Persian and Arabic /]/ - a series of different solutions were 
adopted (amply described by Paul 2013, 30-33, §§ 11-12). 

For Persian texts in Syriac script, too, the ‘only Arabic’ letters were easily 
transliterated by means of the corresponding letters <h t' s q> of the Syriac alpha¬ 
bet. The Syriac letters <t> and <d>, which could also represent, with a point under 
the letter, the spirant allophones of /t/ and /d/, were also used to transliterate 
the Arabic letters ta and dal; likewise <k>, <p> and <g> with a point below were 
used to represent Arabic and Persian /x/, /f/ and /y/. On the model of Arabo- 
Persian orthography, however, in some texts /f/ was represented by <p> with a 
point above, instead of below. For Arabic <z>, for which no letter was at hand, 
some Syro-Persian texts used <t> with a dot above, again a clear caique of the 
Arabic letter za. 

The same can be said for the New Persian texts in Manichaean script. By 
the time Manichaean script was being adapted to New Persian (to be tentatively 
placed at the end of the 9 th - first half of the 10 th century), six new letters had 
already been added to the original 22 of the Aramaic alphabet to write other 
Iranian languages, and were already present in Manichaean texts in Sogdian: <6 
y 8 f j x>. For the Arabic words entering New Persian, therefore, the letter <8> 
was used to transliterate dal; and the same letter, single or more often doubled, 
<88>, was used to transliterate ta. Two new letters introduced by punctuation 
(<k> and <q> with two dots above) - already used in Turkish texts in Manichaean 
script - were used in Manichaean New Persian to transliterate qaf, given that 
simple <q> (without dots) had already been used (as an alternative to <k>) to rep¬ 
resent k. Moreover <’> with two dots above was introduced to represent 'ayn in 
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Arabic words, because in Manichaean orthography <'> had already been used 
to represent an initial palatal vowel. For <d> of Arabic words different solutions 
were adopted: <z>, as in <z’wbt> for dabit ‘commander; chaste’ in the ‘Catechism’ 
edited by Sundermann (2003, cl9; a different reading for this word is suggested 
by de Blois 2006, 114, s.v.; see also Shokri-Foumeshi 2014, 202-203), and <d> in 
<hawwd> hawd ‘basin, cistern’ (Sundermann 2003: el8). 19 On the other hand, the 
‘Arabic’ letters <t>, <s> and <h> could not be transliterated, because the corre¬ 
sponding letters of the original Aramaic alphabet had already been used to note 
sounds of the Iranian languages. 20 Indeed, Manichaean orthography was already 
so loaded with graphic habits fixed for other Iranian languages that it became 
impossible to render all the ‘only Arabic’ letters into this script. The creation of 
new letters had in the meantime come to an end, soon to be followed by the dis¬ 
appearance of the Manichaean religion from the pages of history. 21 


3 New Persian texts in scripts different from 
Arabic 

Let us now look at the most ancient New Persian texts in non-Arabic scripts 
- which cover exactly the period for which we have no original documents in 
Arabo-Persian - in the hope of discovering some indirect evidence about time 
and place of the formation of the Arabo-Persian orthographic canon. 


3.1 Judaeo-Persian texts 

Some Judaeo-Persian texts, that is Persian texts written by means of the Hebrew 
alphabet, are among the most ancient written New Persian documents. The 
Jewish minorities living in Iran spoke Persian, or one of the various Persian 


19 The latter spelling is interpreted by de Blois (2006, 96) as being dictated by ‘the ‘Persian’ 
convention of representing a postvocalic interdental as d rather as 8’, exceptionally applied also 
to an Arabic word. Filippone (2011, 186), instead, thinks that this spelling reflects a dialectal 
pronunciation. 

20 No loanword with Arabic <z> is attested in Manichaean documents published up till now: cf. 
the glossaries by de Blois 2006 and Provasi 2011, 163-168. A glossary of all Arabic loanwords in 
Manichaean New Persian texts has been published by Shokri-Foumeshi 2014. 

21 On the adaptation of the Manichaean script to the Iranian languages cf. Henning 1958,73-75; 
and for writing New Persian, cf. Henning 1962, 89-91, Orsatti 2007,150-164. 
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(or Iranian) dialects spread throughout the Iranian linguistic area, as their mother 
tongue (Yarshater 1974); and, for written purposes, they used Persian (more or 
less tinted with dialectal features) written in Hebrew characters. In the past, scho¬ 
lars thought that the texts emanating from Jewish minorities revealed a Persian 
dialect different from the language of their Muslim neighbors. Recent studies, 
however, especially since the discovery and publication of the manuscript of the 
Persian dialectal translation of the Qur’an known as Qur’an-i Quds (see above), 
have shown that this is not so, apart - of course - from the presence in Judaeo- 
Persian texts of some Hebrew loanwords and expressions. A number of linguistic 
features known until then only from Judaeo-Persian texts were also found in this 
Muslim text (Lazard 1990). In general, the Judaeo-Persian texts, as well as the 
New Persian texts in Syriac and Manichaean scripts, reflect written varieties of 
New Persian differing from literary New Persian. Recent studies, moreover, have 
focused on a difficult task: analyzing the dialectal variations within the Judaeo- 
Persian and other written traditions of Iran. 22 

We do not know when the Hebrew alphabet was adapted to write Persian. 23 
As only texts dated or datable to the Islamic period are extant, it is generally sup¬ 
posed that the adaptation of the Hebrew alphabet to the Persian language occur¬ 
red only in Islamic times. But some scholars think that a Judaeo-Persian literature 
(in particular translations of the Bible into Persian) in Hebrew characters must 
have already existed in the Sasanid period (Bacher 1904). An answer to this ques¬ 
tion would be essential in order to ascertain the linguistic value of Judaeo-Persian 
orthography and the possible presence of historical spellings. In what follows 
I will present the main orthographic characters of some Judaeo-Persian texts, 
grouped according to chronological and geographic criteria. 

Group A. The most ancient dated Judaeo-Persian documents are three short 
inscriptions carved on a rock in a mountainous passage, Tang-i Azao, in Western 
Afghanistan, left by three merchants bearing Jewish names, who were coming 
from Koban, the ancient name of the Qabul valley. These inscriptions (edited by 
Henning 1957) are dated, according to the Seleucid era, to 1064, corresponding to 
752 CE. 24 They are all very short; no Arabic word is attested. 


22 On Judaeo-Persian dialectology, after the groundbreaking article by Lazard 1968, see Shaked 
2009 and Lazard 2014. 

23 For a thorough analysis of the adaptation of the Hebrew alphabet to write Persian, cf. Paul 
2013, 23-48. 

24 Rapp (1967, 55-56) has unconvincingly questioned the dating proposed by Henning (1957, 
338), proposing a much later date: 1299-1300 CE. 
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The main orthographic features of these inscriptions are the following: 
the idafa particle is written with a yod attached to the following word, unlike 
the Arabo-Persian orthography: <(y)’r y’wy> yar-i by ‘his Friend’ (C3); 
the suffix pronoun of the 3 rd singular person -as is written separated from the 
preceding word, i.e. with initial ’ aleph : <y’r ’§ ’w b’d> yar-as 6 bad ‘May He be 
his helper’ (A3, B2-3); 

the letter qoph of the Hebrew alphabet is used to represent the Persian sound 
/k/: <qnd> kand ‘(he) incised’ (A2, B2), <qy> ki ‘who’ (A2). 

The use of <q> to represent /k/ implies that the Hebrew kaph could be left to repre¬ 
sent the voiceless uvular spirant /x/ of Persian (no occurrence of x is to be found 
in these short inscriptions, however), for which sound no letter of the Hebrew 
alphabet was at hand. This can be taken as evidence that, in all probability, 
Arabic words had not entered the Persian language yet, or at least not the lan¬ 
guage represented by these inscriptions (Lazard 1968, 82). 

Group B. Chronologically, after the inscriptions of Tang-i Azao there follow two 
letters discovered at Dandan Uiliq (Central Asia, northeast of the Khotan oasis), 
and referred to as DU1 and DU2, datable to the second half of the 8 th century. 
Of the first one, a fragment of a commercial letter, a continuous reading cannot 
be given, as the left and right margins of the sheet have been badly damaged. 25 
The other, also coming from the same area and certainly written in Khotan, has 
recently been published by Zhang and Shi (2008) with a study in Chinese (which 
I have not been able to read). Both represent the same language and the same 
orthographic usage. 

The main orthographic features of these letters are: 

1. <q> represents Persian /k/, and <k> represents /x/: <qrdwm> kardum ‘I made’ 
(DU1, 2), <kwdh> xudah/xudah ‘God’ (DU1 and DU2 passim); 

2. sometimes long d is not written: <yptwm> yaftum ‘I found’ (DU1,28), <sd hzr> 
sad hazar ‘hundred thousand’ (DU2,1), <kwhrq> xwaharak ‘sister’ (DU2, 3); 

3. the idafa particle/relative pronoun is variously represented: 

<’y>: <’z swy ’y mn> az soy-i man ‘from me’ (DU1,18), <’z swy ’y dwyd r’> 
az soy-i Dawid-ra ‘from David’ (DU2, 34) <q’r ’y prmwdy> kar-i farmudi 
‘the work which you ordered’ (DU1, 29, with idafa as relative pronoun); 


25 Published by Utas (1968), with a bibliography of the previous studies. Lazard (1988) has 
given a valuable contribution to the reading and interpretation of a number of passages from 
this text. 
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not written before a palatal vowel, with which it probably blended: 
<pnn’m yzyd> pannam(-i) izid/ezid ‘in the name of God’ (DU2, 1), <tn 
ysm’> tan(-i) isma ‘yourself’ (DU1,23), 26 <n’mh ysm’> nama(-i) isma ‘your 
letter’ (DU1, 28, 33), <mrdwm’n ysm’> mardumdn(-i) isma ‘your people’ 
(DU2, 3); 

<y->: <kwdh yqrbqr> xudah-i kirbakkar ‘the beneficent God’ (DU2,1); 

<-y> attached to a preceding demonstrative pronoun an ‘that’: <’ny 
an-i> ‘belonging to’ (DU1, 4, 13), <wbr kwndwm ’ny nbyst bwdy> u bar 
xwandum an-i nibist budi ‘and I read what you had written’ (DU2, 7, with 
-i relative pronoun and an old past participle, without final -a) 

4. the conjunction is written <w> attached to the following word: <swd wzy’n ’y 
man> sud u-ziyan-i man ‘my profit and loss’ (DU1,14), <bzwrg wqwdq> buzurg 
u kodak ‘young and old’ (DU2, 3); 

5. <s> is used for c and j: <swn> cun ‘as’ (DU1,13), <smh> Jama ‘clothing’ (DU1, 
10), <pns> pan) (DU2,14, 27); 

6. <by> representing the directional preposition he/hi is well attested: <by sm’> 
be sima (or be-sma with contraction?) ‘to you’ (DU2, 27); 

7. final -a is regularly written <h>: <prwkth> furoxta ‘sold’ (DU1, 10), <n’mh> 
nama ‘letter’ (DU1, 28, 32), except than in the monosyllables <p’> pa ‘in, to’ 
(DU1 and DU2 passim), <m’> ma- (verbal prohibitive prefix) <m’ kwr> maxwar 
‘do not suffer’ (DU1,30), <n’> (verbal negative prefix) <n’ d’nwm> nadanum ‘I 
do not know’ (DU2, 27), and in the pronoun isma/sima ; 

8. <’> is sometimes omitted before an initial vowel other than long or short a: 
<dwr by wpt’d> dur biyuftad ‘(it) was delayed’ (DU1, 7), <ydwn> edun ‘so’ 
(DU1, 22,24, 31), <yzyd> izid/ezid ‘God’ (DU1 and DU2passim); 27 

9. in the first letter (DU1) there are no Arabic loanwords, but in the second one 
(DU2) a few are attested. Their spelling does not reproduce the original Arabic 
writing: haldm ‘doctor’ is spelled <hqym> (Du2, 4,13), harb ‘war’ is spelled 
<hrb> (DU2, 33), without the Hebrew letter <h> being used to represent the 
Arabic emphatic h; 


26 In these texts the personal pronoun for the second person singular is probably to be read 
isma (also perhaps alternating with sima), a form occasionally attested in Early New Persian 
(Lazard 1988, 208). 

27 See the word <myd> umed ‘hope’ in the Qur’an-i Quds in Arabic script (Filippone 2011,190) 
and <mwd>, with dialectal u/u for e, in the Judaeo-Persian inscription A (1. 3) from Tang-i Azao 
(Henning 1957, 342 and n. 2). 
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10. perhaps the ancient directional preposition o (reduced to o/u or a and repre¬ 
sented by <’> attached to the following word) is represented in <’pys> a-pes 
‘near, before’ (DU2, 8). 

Therefore - except for the coincidence with the Arabo-Persian orthography in the 
spelling of final -a as <h> - the occasional defective spelling of a and of initial 
’aleph, the different ways the idafa particle is written, the use of letters <q> for /k/ 
and <k> for /x/ are all features representing an altogether different orthographic 
tradition, compared to the Arabo-Persian one. 

Group C. Two dated documents pertaining to the Ahvaz or Southwestern group 
of Judaeo-Persian documents (10 th - first half 11 th centuries) are considered here: 
a legal document dated 1262 of the Seleucid era/ 950 CE, edited by Shaked (1971); 
and a legal document dated Ahvaz (Khuzistan, south-western Iran) 1332 Seleu- 
cid/1020 CE. In the latter (edited by Asmussen 1965, and generally referred to as 
Law report of Ahvaz): 

1. <k> represents k, and <q> represents q of Arabic or Hebrew words, in contrast 
to the usage in the first two Judaeo-Persian texts already discussed: <knd> 
hand ‘he snatched’ (1.5), <qw’my> qiwami ‘right, lawful’ (1.12); 

2. the letter sade <s> is used for Persian c and/, but is also used to represent the 
Arabic letters dad and sad: 

<s> = c: <bs’> bac(c)a ‘baby’ (1.4) 

<s> = / in both Persian and Arabic words: <swml’> jumla ‘whole’ (1. 2), 
<pns> pan] ‘five’ (1.5), <swpt>/u/f ‘couple’ (1.5), <sw’b >fawab ‘answer’ (1. 
7); only once/is written <g>: <w’gyb> wajib ‘necessary’ (1. 8) 

<s> = s: <mysr> Misr ‘Egypt’ (1.5) (for Persian s, the letter samek <s> of the 
Hebrew alphabet is used: <s’l> sal ‘year’) 

<s> = d: <srwr’> darura(t), ‘need’ (1. 8), <r’sy> radl ‘content’ (11.10-11) 

3. <h> e <h> of the Arabic alphabet are carefully distinguished: <shwtwm> 
sahwat-um ‘my desire’ (1.5), <hwst> huffat ‘proof’ (1.15); 

4. <t> and <t> are carefully distinguished: <byst> bist ‘twenty’ (1. 5), <slt’ny> 
sultani ‘sultanial’ (1.12); 

5. the Arabic 'ayn <’> is always written: <'ws> 'iwad, 'awad ‘compensation’ (1.11); 

6. the idafa: is spelled as <y-> attached to the following word or is not written: 

<y->: <kwd’wnd’n ymylk> xudawandan-i milk ‘the owners of the property’ 
(1. 9), <'ws y’yn drh’> 'awad-i in durha ‘as recompense for these pearls’ 
(1. 11), <p’yn ykyrdy> p-in-i kirdi ‘in this that you have done’ (1. 7, with 
written idafa/ relative pronoun) 
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not written: <’ws ’yn dyn’r> 'awad-i in dinar ‘in place of these dinars’ 
(1. 12); <bd kyrdy ’yn kyrdy> bad kirdi in(-i) kirdi ‘you did wrong (what) 
you did’ (11. 6-7, with not written idafa/ relative pronoun); 

7. the directional preposition outcome of Middle Persian 6, already reduced to 
u/o or probably a, is spelled <’> attached to the following word: <’pys> a-pes 
‘near, before’ (11.1, 3, 9,10); it can also introduce a direct object; <’hsr’ kyrd 
’dny’l> ihdar kird a-Daniel ‘she cited Daniel’ (1. 3); 28 

8. the directional preposition <by> is not attested. 

Group D. Here only one text is considered, the Tafsir of Ezekiel, that is a trans¬ 
lation and commentary of the Book of Ezekiel. Its manuscript (ms. Firkowicz I 
1682 of the St Petersburg Public Library, edited by Gindin 2007) is datable to a 
period between the late 10 th and the early 11 th century. The so-called ‘Part 1’ of the 
manuscript (pp. 1-169 and 221-226) seems to represent a northeastern dialectal 
variety of New Persian probably originating from northeastern Iran or Afghanis¬ 
tan (Gindin 2007, 23-26). 29 In Part 1: 

1. the Arabic words are transliterated, and retain their original Arabic spelling: 

<t> (and not <s>, as in the Ahvaz document) is used for Arabic d (cf. Paul 
2013,33) 

<g>, or <g> with a stroke (and not <s>, as in the Ahvaz document) is used 
for J, in both Arabic and Persian words: <g'wd ’z ’w> jud az 6 ‘different 
from that’ (5.8, Gindin 35), <g'’myh’>jdmayihd ‘clothes’ (38.4, Gindin 73) 

2. the orthography of the idafa particle is the same as in the Arabo-Persian 
writing system, apart from rare cases in which it is written <-y> also after a 
word ending with a consonant: csrhy ’n ’ydr by krd> sarh-i an edar bikard ‘he 
explained it here’ (3.14, Gindin 33) 

3. final -a is written <h> mainly in past participles, but retains the spelling with 
final <’> in monosyllables and in many words: 

final -a is written <h>: <bwdh> buda ‘been’ (35.21, Gindin 69), 
final -a is written <’>: <hm’> hama ‘all’ {passim ), <s'yhr’> cihra ‘face’ (38.7, 
Gindin 73), <p’ zm’n’y msh> pa zamana-yi Moseh ‘at the time of Moses’ 
(5.10-11, Gindin 35) 


28 The same form <'hsr’>, probably to be read ihdar, is also found in the other legal document 
dated 950 (Shaked 1971,1.1). 

29 Recently Lazard (2014, 91-92) has instead argued that the language of this Tafsir has a north¬ 
western or central-northern origin. Quotations are given according to the page and line of the 
manuscript, followed by the page in Gindin’s edition. 
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monosyllables: <m’ kwn> makun ‘do not do’ (5.6, Gindin 35), <n’ kwnd> 
nakunad ‘does not do’ (5.9, Gindin 35) 

4. the defective writing of long a is very rare, not to say absent, and words are 
written as in Arabo-Persian. 

From this brief sketch it is possible to conclude that the orthography of the 
Judaeo-Persian texts taken into consideration seems to show a clear trend: as the 
language becomes richer in Arabic loanwords, orthography shows an effort to 
find the best way to transliterate them. The few Arabic loanwords attested in the 
second letter from Dandan Uiliq are still written as if they were Persian words, 
without distinguishing the ‘Arabic letters’ (Group B). Instead, the numerous 
Arabic loanwords in the Ahvaz legal document (Group C) are carefully transli¬ 
terated, despite the multiple values given to the letter sade <s>, which seems to 
represent the weak point in the writing system represented in this text. In this 
text, however, the word <w’gyb> wajib, written with <g> for ///, is interesting: it 
shows that for /// in the increasing number of Arabic words, a new spelling was 
gaining ground, perhaps taken from Judaeo-Arabic orthography. In the Tafsir of 
Ezekiel (Group D) the multiple values letter sade has in the Ahvaz legal docu¬ 
ment have been made less ambiguous by further differentiating dad of the Arabic 
words from sad, and Persian c from ) of both Arabic and Persian words. 

The first part of this Tafsir shows an influence from Arabo-Persian ortho¬ 
graphy. Indeed, about the way the Hebrew alphabet is used in this manuscript, 
David Neil Mackenzie wrote: ‘It is clearly based on a familiarity with the normal 
Arabo-Persian script, and indeed an excellent knowledge of Arabic’ (Mackenzie 
2003,103-104). If we consider that the Tafsir manuscript is more or less coeval or 
even older than the Law report of Ahvaz, it is possible to measure the importance 
that the north-eastern origin of this text has in regard to its orthography. 


3.2 Syro-Persian texts 

At this point I will speak very briefly about the Syro-Persian documentation, 
that is, New Persian texts in Syriac script, emanating from Christian milieus in 
Iran. Not many texts belong to this group because - in contrast to Persian Jews - 
Christians in Iran have very often used the Arabic script; it is interesting to note 
that, for example, the translations of the Bible into Persian in Hebrew milieus 
are written in Hebrew script, whereas the Persian Gospels are written in Arabic 
script. Christians even dated their manuscripts according to the Muslim era (see 
for example the manuscript of the Persian lectionary studied by Richard 1981). 
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Many Syro-Persian manuscripts are not dated and, even when they are 
datable with reasonable certainly, they have undergone a long transmission 
which may have exerted a normalizing effect on orthography; therefore they are 
not so useful for the purpose of giving indirect evidence on Arabo-Persian ‘par¬ 
allel’ orthography. 

Only the previously quoted fragment from the bilingual (Syriac and New 
Persian in Syriac script) Psalter from Central Asia is probably attested by an 
old manuscript. It shows a clear influence from Arabo-Persian orthography, in 
particular in the complementary distribution of d and 8 (see above). Therefore, 
again, a text from northeastern Iran gives - through its orthographic characteris¬ 
tics - good evidence of how the coeval Arabo-Persian orthography was by then 
well-established, and exerted a strong influence on the other graphic traditions 
of Iran. 


3.3 Manichaean New Persian texts 

For the purpose of writing their texts, the followers of Manichaeism used the 
Manichaean alphabet, named after the founder of this religion, Mani (3 rd century). 
Middle Persian and Parthian, two Iranian languages, were the languages of 
liturgy for Manichaeans. Therefore, New Persian Manichaean orthography shows 
an influence from both these languages’ orthography, as well as from Sogdian 
Manichaean orthography, to the extent that New Persian in Manichaean script 
can be considered as the last heir of the graphic traditions of pre-Islamic Iran. 

From historical sources we know that during the 10 th century Manichaeans 
were obliged to leave Iraq and western Iran taking refuge in northeastern Iran, 
and especially in Samarkand, the ancient capital of Sogdiana. This was one of 
the capital cities of the Persian Samanid dynasty, which ruled over eastern Iran 
in the last quarter of the 9 th century and during the entire 10 th century. Persian 
poetry and prose had its first important blossoming under Samanid patronage. 
Samarkand also had strong links with the Sogdian colonies of Central Asia, in the 
Turfan oasis and other places in Chinese Turkestan where, at the beginning of the 
last century, all the extant texts in Manichaean script were discovered. 

Two fragmentary poetical New Persian texts in Manichaean script were 
published by Walter Bruno Henning in 1962. Henning dates the first of these 
manuscripts, a fragment of the poem Bilawhar and Budisaf, to the first half of 
the 10 th century, during the lifetime of the first great poet of Persian literature, 
Rudakl. The second, a fragment of a monorhymic lyrical composition ( qasida ), 
could be - judging from its script - even older than the first one (Henning 1962, 
99; see also below). 
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These texts are of great interest. From the orthographic point of view they are 
a proof of the historical and conservative character of New Persian Manichaean 
orthography. Indeed, the two poetic texts become metrically readable only sup¬ 
posing - behind their conservative spelling - the new forms fixed through the 
Arabo-Persian orthography. Regarding the second text, the qasida, Henning 
made the assumption ‘that the poem had originally been written in Arabic script 
and was then transliterated into Manichaean script by a man who did not under¬ 
stand it properly’ (1962,99). In fact, an occasional mistake in the spelling of some 
words attached to one another is evidence of poor knowledge of the language by 
the copyist: <cwzg’hyy> for juzgah-i, in the expression juzgah-i sumar ‘except at 
the time of the (last) reckoning’ (1. 20). 

Henning’s brilliant hypothesis of an original Arabo-Persian version from 
which the text we possess has been copied is useful to explain the metrical form 
of the text (or of both texts), composed according to the new quantitative prosody 
deeply influenced by Arabic prosody. This hypothesis, beyond permitting 
Henning’s masterly reading of a text so full of gaps, can certainly account ‘for the 
omission of the Idafe-particle (at least four times) and the word “and”’ (Henning 
1962, 99); but cannot account for all the other places where the idafa particle 
has been correctly inserted (11. 9 after bad, 10 after pes and sarab, 20 after gah, 
21 after zer and 30 after soy). How could a copyist unfamiliar with Persian have 
inserted it correctly? Moreover, how could a copyist unfamiliar with Persian have 
used - for juz ‘except’ in the phrase above - the historical Manichaean ortho¬ 
graphy <cwz>, with <c> being the transliteration of the ancient Aramaic <s>? In 
the Arabo-Persian orthography he would have found, rather, <j> instead of <c> 
(see Table in Section 2.1 above). 

Another mistake made by the copyist, the writing <rrd’> instead of <drd’> 
dard-a ‘o grief!’ (1.1), can likewise be evidence that the antigraph from which the 
copyist was copying was written in Manichaean script. A confusion between <r> 
and <d> is possible in the Manichaean writing, where the two letters only differ 
for a point above <r> or below <d>. Instead, though in later styles of the Arabic 
script <r> and <d> could be confused, it was probably not so in early styles, where 
these two letters had a markedly different form, and <r> could rather be confused 
with final <n>. 30 


30 For <r/z>, <d/d> and <-n> in the Codex Vindobonensis, see the first line of the text (fol. lv): 
sipas bad yazdan-i dana wa tawana-ra. The confusion between <r/z> and final <n> in the earliest 
Arabic writing styles is proved by some mistakes in the quotation of Persian words by Arabic 
authors, as <gm’z> (with <z> = <r> with a point above) for Persian guman ‘doubt’, and <’ydn> for 
eSar ‘here’ (Tafazzoli 1974, 339, 343). 
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Though an original redaction of the text of the qasida in Arabic script cannot 
be excluded, we can suppose that the text we possess was copied - by an inaccu¬ 
rate copyist who was probably not completely familiar with Persian - not from a 
manuscript in Arabic script, but from one in Manichaean script already showing 
a strong influence from the new Arabo-Persian orthography. Evidence of such 
an influence must be seen - as pointed out by Henning - in the cases in which 
the idafa particle, and in the one case (1.12) in which the coordinate conjunction 
are not written. 31 An influence of the coeval Arabo-Persian orthography on the 
orthography of this text is also proved by the unusual spelling of the idafa parti¬ 
cle in the already quoted phrase juzgah-i sumar (1. 20): in <cwzg’hyy> the idafa is 
spelled <yy> instead of <'yg> or <'y>, which are the normal spellings for it in Man¬ 
ichaean orthography. Lastly, ‘the scribe of the Qaslde allows an occasional -S- for 
postvocalic -d- (‘ryS ‘he brings’, nbwSm ‘I was not’)’, instead of -d-, given through¬ 
out by the copyist of the other poetical text in Manichaean script (Henning 1962, 
90): perhaps this too is an interference from the coeval Arabo-Persian orthogra¬ 
phy. For the preposition pa(d) and the negative verbal prefix ne (or already na), 
the copyist of the qasida prefers the joint spellings <p-> and <n-> respectively, as 
in the new Arabo-Persian orthography, instead of the spellings <pd> and <ny>. 32 

As already stated, Henning considers the fragment of the qasida as even 
older than the other manuscript, the fragment of the poem Bilawhar and Budisaf 
in Manichaean script, which he had dated to the first half of the 10 th century. 
Therefore the fragment of the qasida, and even more so its antigraph, should be 
dated at an early date, possibly between the end of the 9 th and the beginning of 
the 10 th centuries CE. The text of the qasida turns out to be not only one of the 
earliest surviving poems of the kind, but an early original document attesting to 
the parallel development and fixing of the Arabo-Persian orthography. 


31 Cases of unwritten idafas are also to be found in other New Persian Manichaean texts 
(whereas they are quite rare in Middle Persian Manichaean texts): an unwritten idafa should 
probably be recognised in an otherwise incomprehensible passage in the Catechism ( Lehrtext ) 
published by Sundermann (2003, C13-14): <q’lbd k’ x’n’g (h)wm’n’g ’st ”b rwsn ny> kalbad ka 
xana humana ast ab(-i) rosan ne ‘the body, which is like a house, is not bright water’. Also in the 
New Persian Manichaean texts published by Provasi there is at least one instance of an unwritten 
idafa (2011, 149). As to the leaving out of the conjunction, we already saw that this is not a rare 
occurrence in the most ancient Arabo-Persian manuscripts. 

32 A full discussion of Henning’s hypothesis about a possible Arabo-Persian antigraph for the 
qasida is given by Orsatti (2007,161-164). 
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4 Conclusions 

On the basis of the New Persian original documents in non-Arabic scripts here 
examined, we can tentatively date and localise the beginning of the Arabo-Per- 
sian orthographic influence on the other written traditions of Iran: northeastern 
Iran, end of the 9 th - beginning of the 10 th centuries. By then, this orthographic 
tradition appears as already fixed. Though it cannot be excluded that some scat¬ 
tered and unsystematic attempts were accomplished here and there in different 
places of Iran, the hypothesis of a multi-centric origin of the adaptation of the 
Arabic alphabet to Persian, favored for example by Akimuskin (1987, 332), seems 
less probable in the light of our documentation. 
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Writing Judaeo-Arabic 


Abstract: The transcoding of Arabic in Hebrew script found in Judaeo-Arabic texts 
presents a unique case study in the investigation of standardisation processes. 
Three major orthographical phases can be discerned. A phonetic orthography was 
used to write early Judaeo-Arabic. A new Arabicised orthography emerged in the 
9 th century known as Classical Judaeo-Arabic, setting the norms used by a large 
middle class of Jewish Egyptian writers. The economic deterioration and political 
changes which began in the 12 th and 13 th centuries resulted in the breakdown of 
the Jewish middle class. Segregation between Muslims and the minorities in the 
Islamicate increased, leading to a return to religious values and a more inward¬ 
facing attitude within the Jewish communities. This is also reflected in the written 
language, which shows an increase in markedly Jewish forms, and eventually led 
to the Hebraisation of Judaeo-Arabic orthography. 

In addition, vast differences in writing standards can be observed in the various 
genres of Judaeo-Arabic from the 13 th century onwards, with a widening gap 
between literary and utilitarian writing. This indicates the absence of an ack¬ 
nowledged supra-regional standard across the Jewish communities. In turn, 
this perhaps points to the lack of a central authority, which the Babylonian and 
Palestinian Academies had been in the Geonic period, issuing norms to which 
writers felt it necessary to adhere. The disappearance of universal standards 
may have been aided by the employment of Jews in government offices and the 
accompanying education of the Jewish elite with respect to the Arabic script. With 
segregation and the lack of Jewish clerks in official government functions in the 
late Ayyubid and Mamluk periods, this particular alley of secondary education 
became unavailable, which produced a knock-on effect on scribal practice within 
the Jewish community. 


9 Open Access. © 2019 Esther-Miriam Wagner, published by De Gruyter. This work is 

licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-003 
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1 Judaeo-Arabic 

In sociolinguistics, language is seen as a means of identity for speakers and 
writers. Writers of particular social groups employ particular forms of speech and 
written text, sometimes consciously, sometimes unconsciously, that mark them 
as members of a certain social entity. Religious communities are a particularly 
interesting object for sociolinguistic studies because they are among the groups 
that draw the strongest sense of communal belonging. 

The sociolinguistics of medieval Arabic and the diverse religious sociolects of 
Arabic present intriguing topics as there is a clear dichotomy in the Muslim part 
of the population between the spoken vernacular, on the one hand, and the lite¬ 
rary language, so-called Classical Arabic, based on the prescriptive norms coll¬ 
ected by the grammarians of the 9 th century. Classical Arabic probably emerged 
from a register associated with poetry and divination, and became the language 
variety in which the Qur’an and the religious literature were codified in the first 
centuries of Islam. It forms the prescriptive standard for Muslim literary writing, 
is relatively uniform and exhibits only minimum variability. Due to its link with 
the Qur’an, Classical Arabic was and is considered ‘correct’ Arabic, while the ver¬ 
nacular has, or at least used to have, little prestige in comparison. 1 

The situation in the Christian and Jewish parts of the medieval population 
was slightly different as they were not bound to the same degree as their Muslim 
counterparts to the literary ideal of al-‘arabiyya. As a result, the norms of Arabic 
in the writings of Christians and Jews varied, at certain times to a considerable 
degree, from those standards employed by their Muslim compatriots. 

Jews in particular created their own language standards as their primary 
education - focussed on enabling children to read the Bible - was conducted in 
Hebrew. 2 The Hebrew alphabet was thus part of Jewish linguistic ability as well 
as of Jewish identity, although there may have also been pressures from outside 
not to use Arabic script for Jewish community affairs. The Pact of ‘Umar 3 , for 
example, contains the promise ‘not to engrave Arabic inscriptions on our seals’. 4 


1 Admittedly, we may be at a watershed moment in time as this attitude appears to be changing 
among younger Arabic speakers, due to the use of mixed registers that include many vernacular 
forms in mobile phone and internet communication, such as on Twitter or Facebook, which is 
written in a modified Latin alphabet. 

2 See Olszowy-Schlanger 2003, and also Wagner 2018. 

3 I will not discuss the disputed historic authenticity of the Pact of Umar here, but refer to the 
article by Cohen (1999) on the topic, and to the discussion pertaining to the general dubiousness 
of historic sources from the Early Islamic period as put down in Astren 2009. 

4 See Cohen 1999,107. 
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Although this may only reflect social attitudes of the 8 th and 9 th centuries, it could 
be suggested that the principles of this text, created as a precedent of relations 
between Muslims, Christians and Jews, still shaped linguistic policies at a much 
later time. Whether it was a matter of identity, literacy or political pressure, or 
most likely a combination of all three factors, Jews mostly wrote their Arabic 
texts in the Hebrew alphabet, creating what we call Judaeo-Arabic. 5 Our earliest 
sources of written Judaeo-Arabic are a small number of papyri from the 8 th -9 th 
century, whereas the earliest extant texts on parchment and paper date from the 
10 th century. 

It was not only the written language forms that differed; speech too was spe¬ 
cific to the respective faith communities. Specific spoken Jewish sociolects may 
possibly have existed as early as the 7 th century, as reported in Islamic sources, 
but scholars such as Fred Astren have pointed out that these early accounts have 
to be read with caution and may possibly reflect much later attitudes (Astren 
2009). Khan (2007, 526) has advised similar caution, since the surviving works of 
pre-Islamic poets ‘do not exhibit anything that distinguishes them from the works 
of their non-Jewish contemporaries’. While it is therefore very likely that Arabic¬ 
speaking Jews spoke a sociolect among themselves throughout the centuries, the 
degree to which it differed from their non-Jewish neighbours must have varied 
considerably. The difference between Jewish and non-Jewish varieties of Arabic 
might have been both marginal and limited to the lexicon in pre-Islamic times, 
but this probably changed with the Islamisation of Arab society and the increa¬ 
sing association of Arabic with Islam, which led to the linguistic segregation of 
non-Muslim communities. The less Jews and Christians were part of the state, the 


5 Several definitions for the term ‘Judaeo-Arabic’ have been proposed, which appears to be part 
of a larger terminological problem when discussing any Judaeo -X language. Criteria suggested by 
a variety of scholars include: ‘written by Jews’, ‘written for Jews’, ‘using the Hebrew alphabet’, 
‘using Hebrew loanwords’ etc. Controversy is often caused by texts that are written in Hebrew 
characters but which present a simple one-to-one transcription of a text in a different alphabet, 
and as to whether these can be defined as being written in a Jewish sociolect based solely on 
the fact that the Hebrew alphabet was used. Such one-to-one transcription of Arabic works into 
Hebrew script, for example in scientific manuscripts, are not viewed as Judaeo-Arabic by some 
scholars because they are seen as mere renderings of an Arabic text in Hebrew characters. These 
texts are therefore sometimes referred to as ‘Arabic written in Hebrew characters’. Yet the rende¬ 
ring of a text in Hebrew script shows in itself that the writer and intended audience of a piece of 
texts must have been of Jewish background. If definition requires that authorship and audience 
be discussed for every piece of writing, this raises highly controversial issues. Geoffrey Khan 
(2007) has, therefore, suggested that the nomenclature of Judaeo-Arabic should be based on a 
purely descriptive criterion: the use of Hebrew script. We follow his suggestion, and call Judaeo- 
Arabic all texts that are written in Hebrew characters. 
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more their language was removed from that of Muslims. In Fatimid Egypt on the 
one hand, where the Shiite rulers created a relatively tolerant atmosphere with a 
large Jewish middle class, in a state in which members of the minorities could rise 
to influential positions in the bureaucracy, the language of the correspondence of 
Jewish merchants would betray little difference to that written by contemporary 
Muslim traders. The linguistic non-conformity of Jewish writing in comparison 
to contemporary Muslim language norms during the later Ayyubid and Mamluk 
rule, on the other hand, is conspicuous. 6 

It is thus not surprising that the best known examples of religiously marked 
speech forms are the spoken early 20 th -century Arabic dialects. Blanc (1964) 
describes how in Baghdad Muslims spoke a variety based on rural bedouin Arabic 
(the so-called gilit dialect) whereas Christian and Jews used to converse in an 
older, urban variety (of the so-called qeltu type). Initially, following the Islamic 
conquests, all three communities probably spoke the same emerging Baghdadi 
form of spoken Arabic. The variety that Muslims spoke, however, changed after 
subsequent waves of immigration from Arabia brought Arab Bedouin who settled 
in and around Baghdad. Due to the prestige that the language of Bedouin had 
among the Muslim population - Bedouin were seen as the arbiters of Arabic, pre¬ 
serving its true original character - their speech patterns spread in the Muslim 
population of Baghdad, who subsequently adopted this more rural dialect as 
their own. Christians and Jews, on the other hand, continued to speak the older 
urban dialect that had emerged following the original Muslim conquests. 

An additional factor for the development of different forms of speech and 
written language particular to their religious groups may have been the desire 
of Christian and Jewish speakers to segregate themselves linguistically from the 
Muslim population and to create their own way of speaking and writing. Perhaps 
this happened not only on account of minority-internal linguistic politics, but 
also due to pressure from the religious majority. In the Pact of ‘Umar, mentioned 
above, one of the tenets put down by the minority communities contains the 
phrase ^j ‘we shall not speak as they [Muslims] do’. 7 Although there 

is a debate as to what is really meant by this statement, and editions such as 
the 1990 version of Muhammad Ibn al-Walid al-Tartus! ‘s Siraj al-Muluk (the most 
commonly used source for the Pact of ‘Umar) by Jaafar al-Bayati omit the phrase. 


6 See Wagner 2010, 229-233; 2017. 

7 Taken from page 136 of the Siraj al-Muluk shelved as Arab.d.58 in the Bodleian Library Oxford, 
which is catalogued as the 1872 Cairo edition, but differs in page numbers from the numbering 
(pp. 229-230) given in Cohen (or any of the popular Pact of ‘Umar webpages), and also does not 
agree with any of the page numbers provided by Cohen (1999,104) for the other known editions. 
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we could interpret it as a deliberate attempt to linguistically separate Christians 
and Jews from Muslim speakers. The specific cultural environment and segrega¬ 
tion of non-Muslim communities thus led to the emergence of sociolects particu¬ 
lar to religious communities such as Jews and Christians. 


2 The Cairo Genizah 

The transcoding of Arabic in Hebrew script that we find in Judaeo-Arabic texts 
presents a unique case study for the investigation of standardisation processes. 
The most suitable sources for this sort of research are documents as they can 
usually be reliably dated and have not been subject to copying and re-editing in 
the same way that literary sources are. Documents also allow us to investigate the 
role of scribes for the standardisation of languages. 

The majority of Judaeo-Arabic documents have emerged from the Cairo 
Genizah. A genizah is a storeroom where old manuscripts are discarded and 
stored. Every synagogue has one, because Jewish customs dictate that anything 
with the name of God written on it cannot be destroyed but must be stored away, 
or buried. In medieval penmanship, almost every piece of writing would mention 
God, and so there was reason to handle them all with care. Through various his¬ 
torical circumstances, the manuscripts of the Ben Ezra synagogue in Cairo were 
never interred but for nine hundred years, Egyptian Jews deposited anything they 
wrote into a large, walled-off chamber, where they were unearthed at the end of 
the 19 th century. 

The total number of leaves and fragments from the Genizah comes to 
c.350,000, of which about two thirds are currently stored in the manuscript 
rooms of Cambridge University Library. The manuscripts date from the 7 th to the 
19 th century, and are mostly composed in Hebrew, Judaeo-Arabic, and Arabic, 
alongside other languages such as Greek, Judaeo-Persian and Yiddish. Many of 
the Genizah manuscripts are Bible fragments, pieces of religious literature such 
as Talmud or Mishnah, religious poetry and ethical treatises, but there is also 
an astonishing wealth of letters and legal documents. It has been estimated that 
about one seventh of all the Genizah fragments are documentary. 

Because of the wealth of data gained from the sources of the Genizah, we 
also have sufficient background information about the writers of the documents. 
The biographies and connections between the more prolific scribes of legal docu¬ 
ments are known as well as those of the merchants belonging to the network of 
traders exchanging business letters. Observations on the linguistic behaviour of 
these protagonists have informed the following sections. 
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3 Standardisation vs. de-standardisation 

In our present world, an innate desire of individuals to standardise language 
forms appears to be taken for granted, whereas linguistic variation is often looked 
upon with disdain and resentment. This was not always the case in the past. In 
the documents and letters of the Cairo Genizah we find that in many cases scribes 
displayed ‘an astonishing degree of inconsistency’ or ‘a predilection for variety’ 
which the scribes ‘must have regarded as a virtue’ (Goitein 1971, 236). This can 
also be seen in spelling. Even the most prolific scribes, such as the court clerk 
Hillel b. Eli who, by Goitein’s estimation, was the second most prolific Genizah 
scribe and who composed hundreds of documents and letters preserved in the 
Cairo Genizah ‘would spell the same word in two adjacent lines in two different 
ways’ (Goitein 1971, 237). 8 Modern sociolinguists, such as Kretzschmar (2009), 
have shown that, in speech, variation is a natural part of the linguistic system. In 
the variations in the writing of the Genizah scribes we may find the equivalent in 
the realm of the written language. 

It is not only the lack of desire to homogenise one’s own language, in itself 
and in comparison to others, that contributes to variation in language. Another 
factor that counteracts standardisation is the desire of scribes to give texts regi¬ 
onal flavours, or to mark them as having emerged from a particular school of 
scribes. Stenroos (2013) has shown this phenomenon for Middle English scribes 
who wilfully introduce linguistic forms into legal documents which do not 
conform to the supralocal standardised variety but which are part of a local dia¬ 
lects. They do this in order to produce ‘copies with a local identity’, perhaps as a 
means to assert their authenticity by giving documents regional rooting, or out of 
a sense of regional pride. 

Standardisation therefore cannot be seen as an unavoidable, linear process. 
Rather, there are factors that may aid de-standardisation, such as regionalisation, 
and deliberate efforts of scribes to vary the language in which they are writing, 
perhaps similar to how we in modern times paraphrase particular words in adja¬ 
cent sentences. Yet, at the same time, within a network of writers certain people 
may attempt to alter their own individual writing style. Nissim b. Halfon, an 11 th - 
century Genizah trader who left behind dozens of mercantile letters, chooses to 
write Arabic za with the Hebrew letter a for the first decades of his career, but 
in his later letters switches to the more popular X employed by the majority of 
contemporary ll th -century Jewish Egyptian writers of mercantile correspondence 


8 In my years of cataloguing Genizah documents I have seen many such examples, sometimes 
three or four different variations of the same word appear in one document by the same scribe. 
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(see Wagner 2010, 30). So within networks of writers we may indeed find efforts 
to produce a linguistic standard universal to that particular group. 


4 Early Judaeo-Arabic orthography 

The earliest works composed in Judaeo-Arabic are papyri from the 8 th /9 ,h century, 9 
which have been analysed and published by Blau and Hopkins (1987,1988). These 
extant Judaeo-Arabic papyri are composed in the Early Judaeo-Arabic phonetic 
orthography, which is based on phonetic principles and shows no influence from 
Classical Arabic orthography (Blau and Hopkins 1984, 124 and Hary 1996, 731). 
This lack of influence is not surprising yet the orthographical choices of those 
early Jewish writers cannot reasonably be compared to medieval material as we 
still do not know enough about the spelling conventions in the Muslim sources 
in the early Islamic period. 10 The Arabic language reforms of the 9 th century ana- 
chronistically superimpose an impression of Classical Arabic normative rules 
onto the earlier centuries, yet the linguistic reality was very different. Ideas on 
this topic have been expressed by various scholars in the last years, in particular 
in the collected volume on Middle Arabic edited by Jerome Lentin and Jacques 
Grand’Henry (2008), which clearly demonstrate that many phenomena normally 
attributed to Middle Arabic are in fact early Islamic writing conventions. Further 
research is therefore still urgently needed to inform our understanding of early 
Islamic written Arabic. 

In the early, phonetically based writing efforts of Judaeo-Arabic the conso¬ 
nantal correspondences between Arabic and the Hebrew alphabet are as follows: 


9 The papyri were initially thought to have been written in the 9 th century, but were subsequently 
estimated as being of earlier provenance. 

10 Early Muslim documents have been analysed and edited by Khan (1992,1993,2007), Hopkins 
(1984) and Grob (2010), but most early texts come from the middle of the second century of the 
Islamic era, i.e. the late 8 th century. 
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The letter dalet 7 thus represents Classical Arabic 3=> <z>, <d>, i <d> and - <d>. 

The difference between the various consonant qualities in Arabic is not marked 
with supralinear signs on top of the Hebrew letter, nor is it marked in the letter 
tav n representing both ci <t> and ^ <t>. Yet it is difficult to assess to what extent 
this reflects the phonetic nature of contemporary spoken Arabic as there is also 
a lack in graphic distinction between phonetically distinct j <j> and £ <g> both 
written as l. In addition, it is hardly imaginable that 3=> <z>, <d>, i <d> and - <d> 
would all have had the same place of pronunciation. As Blau and Hopkins (1987, 
133) point out ‘dalet must have polyphonic function’ in the papyri. Yet, the writers 
chose to use the same letter for all the phonetic realisations of various letters they 
perceived were best represented by dalet 7. 

The Arabic article al is spelled phonetically such as pmx for <al-rahman> 
ar-rahman. This is a main point of difference to the later emerging Classical 
Judaeo-Arabic, which only very rarely spells the article in non-morphophonema- 
tic spelling, i.e. for example pmbK, often also separated pm 7t\. Short vowels, 
in particular [i] and [u], are frequently spelled plene, whereas long a is spelled 
defectively (also a common feature of Early Islamic Arabic). 11 


11 For a more extensive in-depth analysis, see Blau and Hopkins 1987, in particular 124-151. 





Writing Judaeo-Arabic - 81 


5 Transcoding medieval Judaeo-Arabic 

The early phonetic writing appears to have largely been replaced by the 9 th and 
10 th centuries, depending on the geographic region. The change came from the 
East, associated with the transition in writing materials. The 9 th century in the 
area that covers modern Iraq was characterised by a surge in literary production. 
One of the factors that aided this increase in intellectual activity was probably the 
introduction of paper to the country. Paper as a writing material is transformative 
as it facilitates the education of a large mass of people. It is probably no coinci¬ 
dence that the 9 th -century sees the rise of the Arabic grammarians, and advent of 
the standardisation of the Arabic language. 

Thus, with the arrival of paper and the surge of literary production, efforts 
were made to standardise writing. Not only do we see this development in Arabic, 
but also in Judaeo-Arabic. The new Judaeo-Arabic norms that were being introdu¬ 
ced in the 9 th century were influenced by contemporary Arabic writing conven¬ 
tions. In contrast to the earlier Judaeo-Arabic writing efforts, the newly emerging 
Judaeo-Arabic standard was heavily Arabicised and written in an orthography 
where Classical Arabic spelling conventions were imposed on the Hebrew letters. 
Relatively little is known from which scribal schools this new standard emerged, 
but it is clear that the one book that made this style of writing popular was the 
Bible translation by Saadiah Gaon, a resident of Sura and Baghdad in what is 
modern Iraq. His Judaeo-Arabic version of the Holy Scriptures spread far beyond 
his own country, and was ‘quickly found everywhere throughout the communi¬ 
ties of the Near East, North Africa and Muslim Spain, which attest to the fact that 
it acquired an authoritative, almost canonical, status among all Arabic speaking 
Rabbanite communities’ (Vollandt 2014, 69). Saadiah’s Bible translation was not 
entirely original and probably based on earlier Bible translations, but the ‘pin¬ 
nacle in an evolutionary process in which first oral and then written Judaeo-Ara¬ 
bic translation had emerged since the early days of the Arabic conquests’ (Voll¬ 
andt 2011,11). 

Like the translations of scriptures in so many other languages, Saadiah’s 
Judaeo-Arabic version of the Holy Scriptures became linguistically extremely 
influential. Because of its popularity it started to suppress the previous, phonetic 
writing system people had used, set the new standards of writing Arabic across 
the Islamic empire and became the norm for literary Judaeo-Arabic for the next 
centuries. This normative, Arabicised spelling is called Classical Judaeo-Arabic 
by most scholars. 

The use of the Hebrew alphabet meant that particular Arabic orthographical 
traditions were easily abandoned, and it also facilitated the influence of Hebrew 
norms on written Judaeo-Arabic. On the other hand, Muslim standards were occa- 
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sionally applied as a means to alter the register of written Judaeo-Arabic texts, 
and thus, the proximity to Muslim traditions varies considerably in the different 
genres, and throughout time. 

As in the earlier phonetic alphabet, the writers of Judaeo-Arabic had to 
accommodate the fact that the Hebrew alphabet consists only of 22 graphemes, 
while Arabic possesses 28, in Judaeo-Arabic a number of Hebrew graphemes 
were each called upon to represent two (or more) Arabic graphemes in the newly 
emerged Judaeo-Arabic orthography. This concerns the graphemes i, 7, 3, x, a, 
and n. In most cases, one of the set is provided with a dot above the grapheme 
whereas the other is not. Those graphemes supplied with a dot are in many cases 
those equivalent to Classical Arabic ^ <t>, z <)>» t < h>, 2 <d>, <d> and <z> 

(h j 3 etc.), whereas Classical Arabic cj <t>, j <d>, <s>, <t>, t> < § > . and ^ <k> 

are without dot (m3 etc.). Some writers, however, indicate g with the dot and not 
j. Others point both g and/. 
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In contrast to the earlier Judaeo-Arabic orthography, it is very obvious that the 
Arabic alphabet undoubtedly served as a template. In Early Judaeo-Arabic, 
<d> 12 and -•= <z> for example, would have been written with dalet 7, which was 
perhaps closer to the actual pronunciation in the spoken dialects. However, the 
literary standards of Classical Arabic exerted their influence in Classical Judaeo- 
Arabic, and thus the letter that corresponded in form to the one used in the Arabic 
alphabet was chosen, i.e. the graphemes X or X were employed for o^. A slightly 


12 In particular in letters from Byzantium, we also find iff for s/ o“, see Outhwaite 2009, 214. 
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more complex situation is found in the spelling of Classical Arabic -h. Because 
<d> and ^ <z> had merged in the spoken language probably fairly early, or 
perhaps, more precisely, because the difference had phonetically never been 
realised at all to start with in spoken dialects outside the Arabian peninsula, i 
was used to represent both consonants. However, -h is also found spelled as 6, in 
analogy with the shape it takes in the Arabic alphabet. 

Most importantly, in contrast to the phonetic orthography of the earlier 
period, Classical Judaeo-Arabic to a large degree also observes Classical Arabic 
rules concerning the spelling of short and long vowels. Short vowels are only 
rarely spelled, at least in the early medieval period, whereas long vowels are 
usually represented in spelling, as in Classical Arabic. 

Not all medieval writers, however, follow the rules of Classical Judaeo-Ara¬ 
bic. In particular in letters written in crude hands we very often encounter spel¬ 
lings that are more reminiscent of the earlier phonetic writing conventions, see 
Blau and Hopkins (1984). Since crude letter writing is associated with a lack of 
formal scribal education, it is only natural that their writers would not be affected 
by Arabicised orthographical conventions that are clearly a result of educational 
standards of the upper classes (Goitein 1971, 346). 


6 A middle class of medieval writers 

We have an abundance of information concerning the education of medieval 
Jewish Egyptians from the Genizah sources. Children’s education in Hebrew was 
mentioned above, where all boys but also girls went to schools, with literacy 
‘exceptionally high’ (Olszowy-Schlanger 2003, 47), in particular in urban centres. 
Once more grown up, the Genizah writers had their skills honed by various chan¬ 
nels of secondary education. Goitein (1971, 183-185) has distinguished three dif¬ 
ferent types of professional scribes: first the government clerks working for the 
chanceries, who were proficient in writing Arabic script, and who received their 
training within these government institutions (Zcatib). Secondly, there are scribes 
who wrote legal documents and letters for the Jewish community and its legal 
institutions, mostly in Hebrew script (so/er). These scribes were trained within 
the Jewish scriptoria; Wagner (2018) has shown the similarity in handwriting 
between teachers and students, and the way students appear to have learned by 
copying out older documents. Arad and Wagner (2013) have also demonstrated 
how professional scribes working for community dignitaries were the final arbi¬ 
ters of linguistic style used in documents, rather than their socially and politically 
more prestigious masters. Thirdly, we have scribes who copied books ( nasih ), 
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which seems to have been a specialised branch of scribal work. Added to this are 
traders and physicians, who - out of need for their professions often proficient 
in Hebrew and Arabic scripts - developed their own writing standards slightly 
outside of the norms and habits of professional scribes, which includes different 
styles of handwriting. 13 

The effect of high educational standards for all professional classes is con¬ 
vincingly demonstrated in mercantile letter writing from the 11 th - century, which 
represents by far the largest percentage of medieval Genizah writing. 14 Religious 
tolerance and economic prosperity in ll th -century Fatimid Egypt produced a large 
middle class of Jewish traders who conducted business around the Mediterra¬ 
nean. These merchants created their own writing conventions in their extensive 
business correspondence, which is astonishingly homogeneous on a linguistic 
level and also displays a strong influence of contemporary Muslim letter writing 
norms. This is all the more astonishing as traders are usually known for their 
‘pragmatic literacy’ (Parkes 1973, 555) and lack of adherence to superimposed 
literary norms. 

From the way Arabic is used in merchants’ letters and from information stated 
by the traders in their correspondence we can infer that many were very familiar 
with the Arabic alphabet. Muslim-Jewish business partnerships were common 
under the Fatimids, making it necessary for the Jewish merchants to deal with 
Arabic mercantile documents on a daily basis. Letters often have addresses in 
both Arabic and Hebrew script, displaying how fluid many merchants were in 
both alphabets. 

Not only merchants do display their aptitude in the different alphabets. Some 
authors use different alphabets almost playfully. For example, Daniel b. Azaryah, 
an ll th -century leader of the Jewish community, employs Arabic script in letters 
to particular recipients as a way to save space at the end of line (writing the same 
expression in Arabic script is a lot more space effective than it would be in Hebrew 
script, see Fig. 1). Others, for example a writer called Judah b. Abraham, switch 
between Arabic and Hebrew script when they alternate between the Hebrew and 
Arabic languages (Fig. 2). We can thus assume schooling in both Hebrew and 
Arabic scripts for the protagonists of particular professions. 

Education in Arabic must be responsible for the linguistic behaviour of par¬ 
ticular writers, who follow particular Muslim conventions that are not ordinarily 
part of the Judaeo-Arabic repertoire. These are typically linguistically conservative 


13 For traders, see Wagner 2017. 

14 Circa 1000 letters have been edited in Gil’s monumental works In the Kingdom of Ishmael 
alone (Gil 1997). 
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Fig. 1: TS-013-J-026-002-F. © Reproduced by kind permission of the Syndics of Cambridge 
University Library. 
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forms, which can be found in otherwise linguistically quite progressive material. 
This is not unusual: Nevalainen (2013) has shown that individual Middle English 
writers may adopt some progressive forms, but shun others. A good example is 
presented by otiose ’alif- the silent ’alif at the end of particular verbal forms such 
as 3 rd person plural perfect. This otiose ’alif features in Saadiah’s Bible transla¬ 
tion and following that becomes part of Judaeo-Arabic literary texts emulating 
Saadiah’s linguistic standards, yet it is never actually a full part of the repertoire 
in epistolary writing. In a corpus of more than a hundred medieval Judaeo-Arabic 
letters, otiose ’alif only features in the correspondence of three writers: in a circle 
of educated writers surrounding the 13 ,h -century judge Elijah b. Zechariah. Thus, 
within Judaeo-Arabic letter writing, otiose ’alif was used exclusively by small net¬ 
works of people, perhaps as a means to demonstrate a certain standard of educa¬ 
tion. The prestige of a Classical Arabic form that is very markedly a part of a high 
register thus also appears to have held appeal in Jewish writing networks. 

Prestige and precedent generally play an important role in the letters. Once 
a form has been brought to paper, it is then often used by other authors. Writers 
with high prestige will find the forms they employ being used by those of lower 
social standing. A good example demonstrating the importance of prestige may 
be found in the handwriting of scribes and individuals of the late Fatimid and 
early Ayyubid period. From the 12 th -century onwards, the handwriting of Egyp¬ 
tian Jews became considerably more cursive compared to the earlier, squarer way 
of writing Hebrew script. It has been suggested that this is due to the influx of 
Spanish Jews into Egypt. In Spain, through the closer contact with Arabic culture, 
the way of writing Hebrew had been permeated by the cursivity of the Arabic 
script. Due to the high prestige of Spanish Jews (among them the most famous 
personality of the Genizah, Moses Maimonides), this trend of cursivity also 
caught on among the Egyptian Jewry. 


7 Changes from the 13 th century onward: 
the ‘Hebraisation’ of Judaeo-Arabic writing 

The homogeny and orientation along the lines of Classical Arabic found in 11 th - 
century mercantile writing contrasts sharply with letter writing in the Ayyubid 
empire in the 13 th century. During that time, we see the economic worsening 
already experienced in the 12 th century coming into full effect, which results in 
the breakdown of the Jewish middle class. Segregation between Muslims and the 
minorities in their states increased, which led to a return to religious values and 
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a more inward-facing attitude within the Jewish communities. 15 This can also be 
observed in writing: correspondence of the 13 th century begins to exhibit a much 
stronger influence of Hebrew norms, counteracting the Arabicisation imposed in 
the earlier centuries. These differences can be correlated to the advancing lack 
of integration of Jews within Egyptian society. In the Jewish communities, incre¬ 
asingly more isolated from other faith communities, the Hebrew influence on 
language became much stronger and where we used to find Arabic formulae in 
the introduction and blessings of letters, they have now been largely replaced by 
their Hebrew counterparts. 

On an orthographic level, Hebrew influence is most noticeable in the incre¬ 
ase of plene spelling of vowels, in the spelling of reduplicated [w] and [y] and 
in the way particular morphemes are written. For example, the early medieval 
sources usually follow Classical Arabic conventions and do not mark the redup¬ 
lication graphically within the rasm (graphic line) of the word (although we also 
find sadda used above the Hebrew letters by certain writers). In the spelling of 
the 3 rd person singular masculine suffix pronoun, the spelling V <-w> that denotes 
the suffix in Hebrew occurs as frequently as (and in particular texts even more 
frequently than) the Classical Judaeo-Arabic spelling n- <-h>. It is worth noting 
that the Hebrew spelling conventions that are applied in the latter example are 
closer to the actual pronunciation of the suffix [-u] in the spoken Egyptian Arabic 
than the Standard Arabic orthography of [-h]. These phenomena can already be 
observed in material from the 10 th -12 th centuries but the increase in frequency 
over the century from the 13 th century onwards is very noticeable (see the table in 
Wagner 2010, 40). 

In addition, vast differences in writing standards can be observed in the 
various genres of texts from the 13 th -century onwards and in particular in the Late 
Judaeo-Arabic of the Ottoman period. 16 The norms start to be mostly dependent 
on the type of text, in contrast to the period of the 10 th -12 th centuries during which 
Judaeo-Arabic across the genres was largely standardised. For example, the gap 
between utilitarian prose (in particular letters) and literary works is widening; 
letters are often written in a much more colloquial language than literary texts, 
which still follow Classical Judaeo-Arabic standards to a degree. At the same time, 
there is also enormous variation of forms when different writers are compared to 


15 Goitein 1978,161-162. For the changing attitudes towards minorities, also see Leiser 1976, 
in particular 68-88. 

16 This article follows Khan’s (2007, 526) periodisation, who proposes three major phases: Early 
(9 th century), Classical (10 th —15 th centuries) and Late Judaeo-Arabic (from 15 th century onwards). 
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one another, even though the Genizah contains much fewer sources from the 13 th - 
century onwards as compared to material from the 10 ,h -12 th centuries. 


8 Conclusions 

To sum up: Language provides identity for its speakers and writers, and members 
of the different confessional groups employ particular forms of speech and 
written text that mark them as members of a certain social entity. Jews in particu¬ 
lar created their own language standards as they, because of education, religious 
identity and perhaps political pressure, wrote their Arabic texts in the Hebrew 
alphabet, creating what we call Judaeo-Arabic in the process. 

The transcoding of Arabic in Hebrew script found in Judaeo-Arabic texts pre¬ 
sents a unique case study for the investigation of standardisation processes, and 
the most suitable corpus can be found in the documents of the Cairo Genizah. 
Standardised and varying forms used by the scribes of the Genizah demonstrate 
that standardisation cannot be seen as an unavoidable, linear process. Various 
factors may lead to de-standardisation, whereas network-internal linguistic stan¬ 
dardisation may prove to be the driving force behind general standardisation 
processes. The main factor controlling the assertiveness of linguistic forms is 
perhaps the social prestige of those protagonists setting precedent. 

In Judaeo-Arabic, three major orthographical phases can be discerned. In the 
early Islamic centuries, a phonetic orthography is used to write early Judaeo-Ara¬ 
bic. Affected by the standardisation efforts of Classical Arabic in the 9 th century, 
a new Arabicised orthography emerged for Judaeo-Arabic too, aided by the spread 
of Saadiah’s Bible translation; this initiated the period of so-called Classical 
Judaeo-Arabic. Classical Judaeo-Arabic was not only a literary standard but can 
be found in utilitarian prose, too: the efficiency of high educational standards 
in the relatively tolerant and prosperous ll th -century Fatimid Egypt produced a 
large middle class of Jewish traders who produced their own writing conventions 
in their extensive mercantile correspondence, displaying a strong influence of 
contemporary supra-communal letter writing norms. 

The economic deterioration which began in the 12 th and 13 th centuries resul¬ 
ted in the breakdown of the Jewish middle class, and eventually led to the Heb- 
raisation of Judaeo-Arabic orthography. Segregation between Muslims and the 
minorities in the Islamicate increased, leading to a return to religious values and 
a more inward-facing attitude within the Jewish communities. This can also be 
observed in writing: correspondence of the 13 th century begins to exhibit a much 
stronger influence of Hebrew norms. 
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In addition, vast differences in writing standards can be observed in the 
various genres of Judaeo-Arabic from the 13 th century onwards, with a widening 
gap between literary and utilitarian writing. This could be interpreted as indi¬ 
cating that, in the absence of an acknowledged supraregional standard across 
the Jewish communities, local networks, scribal schools and gathering of writers 
developed their own norms, which they then followed. In turn, this perhaps 
points to the lack of a central authority, such as the Babylonian and Palestinian 
Academies had held in the Geonic period, issuing norms to which writers felt 
the urge to adhere. In comparison, during the Classical Judaeo-Arabic period, the 
writing standards used in the various literary and utilitarian genres were rela¬ 
tively homogeneous, and fairly close to those norms used by Muslim contem¬ 
poraries. This may have also been aided by the employment of Jews in govern¬ 
ment offices, and accompanying Arabic script education of the Jewish elite. With 
segregation and the lack of Jewish clerks in official government functions in the 
late Ayyubid and Mamluk periods, this particular alley of secondary education 
became unavailable, which must have had a knock-on effect on scribal practice 
within the Jewish community. 

The phenomenon of linguistic de-centralisation from the 13 th century onwards 
is by no means restricted to the Jewish community; comparable processes can be 
observed in Muslim Arabic writing, too, as can be best seen in Muslim Arabic 
from the 16 th century onwards. 17 Social, political and economic circumstances 
thus exerted a similar influence on all religious communities, regardless of their 
linguistic differences. 
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Cross Palaeographic Traditions. Some 
Examples from Old Christian Arabic Sources 

Abstract: This chapter deals with early Palestinian Arabic manuscripts from South 
Palestinian monastic collections now kept in the library of St Catherine’s Monas¬ 
tery, Egypt. The aim of this study is to demonstrate that in the early Arabisation 
process of the Melkite Palestinian Church (8 th -9 th century), it is possible to find 
palaeographic, linguistic and layout features testifying to inter-faith interaction. 
Accordingly, the text of the holy book of Islam might have played an important role 
for the Arabised Melkite communities of Palestine. The early activity of translating 
the Bible and the Patristic and ascetic heritage into Arabic proved to be an impor¬ 
tant stage in the acquisition of the Arabic writing technique by Melkite monks 
living in the Caliphate. By comparing Islamic and Christian sources, I try to cross 
the all too narrow confessional boundaries in which ‘Christian Arabic studies’ 
have been confined for the last two centuries. 


1 Historical introduction 

The Melkite Church was the first eastern Christian church living in the Arab world 
that adopted Arabic as its liturgical language. 1 After the Arabic conquests, the 
Patriarchate of Jerusalem became part of the caliphate’s territory; subsequently 
within the increasing 'Arabic-speaking Melkite community, Jerusalem and its 
monasteries effectively became the centre of a doctrinal development’ (Griffith 
2006,185). Actually, even though Greek was the language that symbolically pre¬ 
served the links with Byzantine orthodoxy beyond the borders of the caliphate, 
Palestinian monks had carried on the enterprise of translating the Bible into local 
tongues since pre-Islamic times (Griffith 1997; Briquel-Chatonnet and Le Moigne 
2008). This means that Palestinian monasteries such as St Saba and St Kariton 
in the Judean desert had always preserved their local identity against the Greek 
culture of Constantinople. Accordingly, the Arabisation of the church of Jerusalem 
after the rise of Islam had a double function: to build an Arab Orthodox identity. 


1 For a short history of the Arabic-speaking Melkite Orthodox Church see Griffith 2006, who 
clarifies why from the 8 th century onward this church decided to translate its religious heritage 
from Greek into Arabic. 


3 Open Access. © 2019 Paolo La Spisa, published by De Gruytei. CDThis work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-004 



94 


Paolo La Spisa 


sociologically and culturally, albeit not doctrinally, distinguishable from their 
Greek Orthodox co-religionists on the one hand, and to be able to produce an 
apologetic literature in Arabic to cope with the new religious challenge of Islam, 
on the other. These are the reasons why, within the Melkite Jerusalem Patriarch¬ 
ate, South Palestinian monasteries were the cradle where Christian Arabic liter¬ 
ature had its origins. The manuscripts that were once in their libraries, are now 
collected and preserved in St Catherine’s Monastery in the Sinai Peninsula (Samir 
1990, 1990-1991; Lafontaine-Dosogne 1996; Gehin 1998; Mouton 2000, 105-124; 
La Spisa 2008; 2012, 210-213). 

In what follows I will examine some palaeographic features in manuscripts 
belonging to the St Catherine Library and directly originating from the South 
Palestinian monastic milieu. After having very briefly outlined the cultural 
framework in which they were produced, I will try to show to what extent the 
standardisation of the orthography, script and layout has been influenced by the 
orthography and language of the Qur’an even in the Christian Arabic manuscripts 
of the first millennium coming from the Arabic Melkite Church. Subsequently I 
will try to assess to what extent it is possible to speak of Christian Arabic features 
within a Muslim religious and cultural environment. 


2 Qur’anic orthography and early Arabic 
manuscript tradition 

The early Arabisation of the Greek Orthodox Church in Palestine led the monks 
to carry out the very first translations of the Bible into Arabic, so we have evi¬ 
dence of Palestinian Arabic translations dating back to the second half of the 8 th 
century. 2 In this context it is legitimate to pose the following question; what kind 


2 Opinions about the existence of a pre-Islamic Arabic translation of the Bible diverge: Baum- 
stark (1929-1931) and Shahid (1995-2009) assumed that even though no material evidence is 
available, it is reasonable to think that such work had been accomplished at least during 
Muhammad’s lifetime. The issue of the existence of such translations received attention once 
again thanks to a recent study by Sidney Griffith, who concludes (2013, 41-42): ‘no conclusive 
documentary or clear textual evidence of a pre-Islamic, written Bible in Arabic translation has 
yet come to light’. Nevertheless, the several Qur’anic references to biblical, hagiographical and 
homiletic literary traditions are undeniable. Griffith explains this phenomenon by stating that in 
pre-Islamic time and during Muhammad’s lifetime there was an oral transmission of the Jewish 
and Christian scriptural and homiletic traditions which were directly and spontaneously trans¬ 
lated into Arabic for an Arabic-speaking audience. However, this does not exclude the existence 
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of Arabic did the scribes adopt for their translations, since the Arabic language 
before the 9 ,h -10 th centuries had not yet been normalised by the Iraqi philologists 
of Basra and Kufa (Fleisch 1990,1-15, Ferrando 2001,117-133)? The first evidence 
of Arabic manuscripts surviving up to the present comes from a few copies of the 
Qur’an reportedly dating back to the 7 th century (Deroche 2004,16). So is it legit¬ 
imate to suppose that the holy book of Islam had influenced even the Arabised 
Christian copyists? 

If we have a look at some Qur’anic Suras, it is possible to single out some 
linguistic and palaeographic phenomena that western scholars have described as 
Middle/Mixed Arabic features (Lentin 1997,2008,2012). In the Surat al-nahl (Q.16: 
72) we read: wa-bi-ni‘mati-llahi hum yakfurun ‘do they repudiate the divine grace?’ 
where the word nVmati is written with a ta’ mabsiita instead of ta’ marbuta, the 
same phenomenon can be found in medieval Christian Arabic texts (Blau 1966, 
115-116). However, in the Surat al-shu'ara’ (Q.26: 22) one can find the same word 
written with ta’ marbuta: wa-tilka nVmat-un ‘is it a favour...?’. Both orthographic 
variants are well attested. One can suppose that the ta’ mabsiita is used only in 
annexations, but in the Surat al-diihd (93: 11) we can read: wa-’amma bi-nVmati 
Rabbika fa-haddit! ‘but as for the favour of your Lord, report [it]!’, where the same 
word in annexation is written with ta’ marbuta. 

The second example is taken from the siirat al-’isra’{ 17: 1) where we read: 
subhana lladi ’asra hi-'ahdihi layl-an mina-l-masgidi-l-harami ’ila-l-masgidi-l- 
’aqsa ‘Exalted is He who took His servant by night from the Sacred Mosque to the 
Farthest Mosque’. The last word al- ’aqsa is an elative form of the adjective qasiyy 
‘faraway’, which literally means ‘the farthest’; however instead of alifmaqsura at 
the end of the word as found in current Arabic orthography, there is an aliftawila 
(cfr. Blau 1966, 81-82). 

Finally, as far as syntax is concerned, in the Surat al-ma ’ida (Q. 5:69) we read: 
’inna lladina ’amanu wa-lladina hadu wa-l-sabi’una wa-l-nasdra ‘Indeed, those 
who have believed and those who are Jews or Sabaeans or Christians’. Following 
the Classical and Modern Standard Arabic grammatical rule, one should expect 
to find the name ‘Sabaean’ to be in the oblique case since it is governed by ’inna 


of Arabic written notes by Christian literate monks and priests as aides de memoire, as Schoeler 
2002 has suggested. About the early Arabic translations of the Gospel see Guidi 1888, Arbache 
2007, Griffith 1985, 2008, 2013, Schulthess 2018. The earliest Arabic Gospel has been recently 
identified by Kachouh (2012) in the Vatican Arabic 13 which was copied in the Judean desert 
monastery of St Saba around the year 800 CE. The earliest New Testament Arabic version known 
so far is Sinai Arabic 154, whose second section contains the earliest Christian Arabic apologetic 
treatise, of 788 CE. Samir 1994; Swanson 1993; La Spisa 2014. 
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(cfr. Blau 1967, 326). 3 4 * * Another example could be taken from the surat al-nisa’ (Q. 
4: 162) where one can read: wa-l-mu’minuna yu’minuna bi-ma ’unzila ’ilayka wa 
ma ’unzila min qablika wa-l-muqimina al-salata wa-1-mu'tuna al-zakata... ‘But the 
believers believe in what has been sent down to you and what was sent down 
before you, and those who perform the prayer and give alms According to 
the Standard Arabic rules as well as to the context and the meaning of this verse, 
one should expect to find wa-l-muqimuna, in the nominative case of the regular 
masculine plural ( al-marfu c bi-l-waw wa-l-niin ) as it is the case of the other nouns 
of the verse which have the same syntactical function ( wa-l-mu’minuna , wa-l- 
mu’tuna ). As we shall see from the following examples, all these variant forms 
are also frequent in written Middle Arabic of the pre-modern era. 

Concerning the orthographic issue of the ta’ marbuta, also in Christian Arabic 
texts, ta ’ mabsiita instead of ta ’ marbuta and also vice versa, is found: j-jj 

‘by the strength of the Holy Spirit’; £ y~ ^4*- ‘the life of Jesus’ (cfr. Blau 1966,115). 

Alifmaqsura instead of aliftawila: 4j-4 lit y£A\j ‘when the woman 

who lost her son is sad, she dresses black clothes’; as is well known, according to 
the standard orthographic rules, the feminine form of is with alifmaqsura. 



Fig. 1: Dayr al-Muhallis 1807 (1643-44 CE) - Gun (Lebanon), fol. 387v l. 7. 

In the 10 th —11 th centuries Melkite bishop of Gaza Sulayman al-Gazzi’s treatise on 
the holy Cross, one can read the following incipit : i lamu ayyuha l-mutaqallidina 
namus salib al-Masih... ‘Know, you who abide by the Law of the Cross of Christ...’ 
(La Spisa 2013,1) where one should have expected to read ayyuha l-mutaqallidunaf 


3 This verse should be compared with two others, which are very similar to each other: that of 
the Surat al-baqara (Q. 2, 62): ‘inna-lladma ’amanu wa-lladma hadu wa-nasara wa-sabi’ina, and 
that of the Surat al-hagg (Q. 22,17):' inna-lladma 'amanu wa-lladma hadu wa-sdbi’ina wa-nasara, 
where the word sabi'ma is written according to the rule of 'inna. For further details on these ver¬ 
ses and their interpretation, see Burton 1988,188-196 and Abdel Haleem 1992, 425-427. 

4 All grammars of Classical and Standard Arabic agree by stating that ‘44 and 44 4 require after 

them a noun, singular, dual or plural, defined by the article, and in the nominative case.’ (Wright 
1962, 92-93, emphasis mine), see also Veccia Vaglieri (1937 [1996 7 ], 1,135; II, 173). For having an 

idea on the discussions about the nida’ among Arab grammarians, see al-’Anbari (1997,128). For 

further examples in addition to the essential work in the field of Middle Arabic by Joshua Blau 
1966-1967, 2002, see also Hopkins 1984; Lentin 1997, 2008, 2012; Grand’Henry 2006, which are 
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In 8 th -9 th century Arabic manuscripts nowadays kept in St Catherine’s 
Monastery, it is possible to find all these aforementioned orthographic and lingu¬ 
istic features that lead us to think that, at the very beginning of Arabisation, the 
written language was the same for all religious communities (den Heijer 2012). 
This statement can be demonstrated by comparing sources dating back to the 
same period but emerging from different confessional and cultural backgrounds. 
The same conclusions could be formulated also for Arabic palaeography. In what 
follows I demonstrate that sources belonging to different religious milieus actu¬ 
ally share the same palaeographic features. 

In the first Abbasid era, the most widespread kind of Arabic script was the 
so-called kiifi, or, as Deroche (1987-1989,353-354) has labelled it, ecritures abbas- 
sides anciennes ‘early Abbasid scripts’ (Gacek 2009, 97-98), whose most relevant 
peculiarities are: 

1. the isolated or final alif with a more or less developed extension below the 
line, 

2. dal with two parallel and horizontal rods, 

3. the median ‘ ayn whose head is constituted by two antennas, 

4. final mim with a horizontal tail. 5 

In Christian Arabic manuscripts of St Catherine dating back to the same period, 
it is possible to find many examples of codices written in what scholars have 
called Sinaitic kiifi or Sinaitic-Palestinian kiifi. Some scholars have supposed 
that this kind of script was originated or directly influenced by the Syriac estran- 
gelo script that was also used in Palestinian monastic scriptoria. 6 Nevertheless, 
by comparing different sources as Deroche has done, a great similarity between 
the ‘Islamic’ and ‘Christian’ variants of the so-called early Abbasid script comes 
to the fore. I will examine in detail some orthographic features in order to show 
this similarity. 


only a few examples selected from the extended literature which developed in these last decades. 

5 See also Deroche 2000, 234, Deroche and Sagaria Rossi 2012,164-167. 

6 Gacek (2009, 1) supposes that ‘the origin of these scripts [Abbasid bookhand] are most likely 
traceable to the first century of Islam and some of them appear to have been influenced by the Sy¬ 
riac serta script’. Many scholars tackled the issue of the origin of the Arabic script in the last cen¬ 
tury. Two hypotheses have been formulated: the first one identifies in the Nabatean inscriptions 
the origin of the Arabic letters (Cantineau 1930-32; Abbott 1939; Gruendler 2006); the second 
hypothesis says that the early Kufi scripts are derived from the estrangelo Syriac script (de Sacy 
1810; Starcky 1966; Troupeau 1991; Briquel-Chatonnet 1997; Noja Noseda 2006), without making 
any distinction between Christian or Islamic Arabic sources. 
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3 The case of the qaf and fa 3 diacritical points 

Father Khalil Samir (1991) described for the first time some palaeographic fea¬ 
tures of early Christian Arabic apology as attested in the Sinai Arabic 154 J The two 
most important phenomena that struck him were the way of writing qaf and fa’ 
and the presence of split words, elsewhere unusual in Arabic. 

Fig. 2: Sin. Ar. 154 (fol. lOlr ll. 1 and 7). 


Samir remarked that, in this 8 th century parchment codex, the qaf is always written 
with a dot below the line, while the fa’ is written in the regular way (a dot above 
the letter). Samir (1994, 60) concluded that: ‘The way the qaf is written seems 
to be absolutely unique in the Arabic script’. However, Monferrer Sala (2010, 
197) carried out a little inquiry about this palaeographic feature within the same 
manuscript and pointed out the same phenomenon in at least two other Sinai 
codices: the parchment Sinai Arabic NF perg. 17 belonging to the new finds of 
the St Catherine Monastery (Meimaris 1985, 27 [Greek] and 25 [Arabic]), and Sinai 
Arabic 1 which is a translation into Arabic of some books of the Old Testament. 
Both codices date back to the 9 th century. Within the same St Catherine manu¬ 
script collection, we can also add as an example the Sinai Arabic 36, which is a 
bilingual (Greek-Arabic) Psalter copied in the 8 th -9 ,h centuries having the same 
palaeographic features. Unfortunately reproductions of this precious codex are 
not available. Thanks to the specimen of fol. lOr published in Lafontaine-Dosogne 
(1996, 110), I could identify the same way of writing qaf and fa’. Monferrer Sala 
(2010, 197) concluded that this way of marking the qaf is a ‘feature characteris¬ 
tic of early South Palestinian texts’. 7 8 Nonetheless it is noteworthy to remark that 
the same feature has also been found in Islamic sources and documents. Nabia 
Abbott (1967) has published some Islamic papyri dealing with Islamic traditional 


7 For the edition of the Apology see Gibson (1899), with an English translation entitled ‘Treatise 
of the Triune Nature of God’. On this Apology and its historical and religious context, see in 
particular Griffith 1985. 

8 The conclusions which Monferrer Sala reached could induce one to think that this feature be¬ 
longs exclusively to the Christian Arabic writing tradition (cfr. D’Ottone 2015, 271). Actually, this 
phenomenon seems to be cross-confessional. 
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literature nowadays kept at the Oriental Institute of the University of Chicago. The 
papyrus nr. 17.630, dating back to the 9 th century, is a fragment of Islamic tradi¬ 
tions ( hadlt ). Abbott (1967, 208) underlines the following palaeographic feature: 
‘Once each, fa and qaf have a dot above and below respectively’. Other non-Chris¬ 
tian examples can be found in several other manuscripts kept in the National 
Library of Paris, in Istanbul and in Saint-Petersburg (Deroche and Sagaria Rossi 
2012,181). 

It is interesting to remind that the qaf with the subscribed point can be also 
compared with the so-called maghribl way of writing fa ’ (with a point below) and 
qaf (with a point above). If we consider the history of the maghribl script and its 
origins, it is not astonishing to find several examples of the same typology even 
in eastern manuscripts, belonging both to Christian and Muslim traditions. It is 
noteworthy to mention an example of this script in what is considered a very old 
translation of the Gospel into Arabic, now kept at the Vatican Library: the Vatica- 
nus Borgianus 95. Despite the relevance of this witness, an in-depth codicological 
description of the manuscript is still lacking. 

This codex 9 is a parchment dating back to the 8 th or the beginning of the 9 th 
century. At the end of the 19 th century Guidi (1888,10) argued that it had probably 
been copied in the St Saba Monastery, in the Judaean Desert of Palestine. Belon¬ 
ging originally to the ‘Collegio de Propaganda Fide’, it was at a later stage part 
of the Borgian Museum taking the catalogue number K. II. 31 before having the 
present number 95. It is folded in-quarto, its dimensions are about 215 x 160 mm., 
the written area varies between 170 x 125 and 190 x 135 mm. with about 16-17 lines 
per page. Nowadays the codex contains 171 folia. I found 23 quires, all of which 
are quaternions. 

Regarding the palaeographic features, it is possible to say that the codex pre¬ 
sents a maghribl Took’. Its script is an early Palestinian naskhi: 10 the fa’ is written 
with a point below and qaf with a point above. As an example, see fol. 16v. line 
4: wa-yasfu, line 8: Ya'qub, line 13: qayilin / qa’ilinf 1 but the qaf is also written 
with two points above. The final alifxs marked with a rod below the writing line. 
If one compared this witness with the western copies of the Qur’an, it should 
not be difficult to recognise many strong similarities with the script of the Arabic 


9 Because of the antiquity of this manuscript, many scholars have discussed it: see as an ex¬ 
ample Guidi 1876-1877, 1888; Tisserant 1914, 55; 1924; Graf 1944, 142, 148; Metzger 1977, 262-263; 
Griffith 1985,154-155; Orsatti 1996,153. 

10 Another example of the same kind of naskhi is in the British Museum Or. 5019 (10 th cent.). 

11 For a sample of this very folio see Tisserant 1914, 55. 
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Gospel of the Borgianus 95. 12 Accordingly, as Deroche has clearly illustrated, it is 
possible to suppose that the script called maghribi today actually had an oriental 
origin. Afterwards the Maghreb preserved it with some minor regional changes. 13 

By way of some final considerations about this question, it should be 
mentioned that the diacritical points are randomly used in most of the quoted 
manuscripts. However, as regards the case of the qaf, it is possible to find it 
without points, with two points above and with a point below in the same docu¬ 
ment, if not in the same folio, as it is the case for the Sinai Arabic 1. As Monferrer 
Sala has rightly pointed out, in this manuscript the verb qala is regularly written 
with a subscribed point, however in fol. lr one can find the following words where 
the qaf is written with two points above: fol. lr line 4: halaqa ‘he created’; fol. lv 
line -4: al-sarraq ‘the thief’; fol. 2r line -1: fawqa ‘above’. The same alternation can 
be found in the Borgian 95. 

From what precedes it is possible to argue for the following hypothesis. 
Between the 7 th and 9 th centuries, the standardisation of diacritical points was not 
yet established. This explains why in manuscripts dating back to this period one 
can find at least four different ways of writing the letter qaf which alternate quite 
frequently: 1) without points, 2) with one point above (the so-called maghribi 
variant), 3) with one point below, 4) with two points above (which became the 
standard form). This alternation and fluctuation can exist even within the same 
document. 14 As Deroche (2004,73) pointed out, in this very period there were con¬ 
stant movements of scribes between East and West. This may explain the eastern 
origin of the graphic variant to write the qaf which afterwards became characte¬ 
ristic of the maghribi script. On the other hand, in the East the standardisation of 
the language by Iraqi philologists stabilised the spelling of the qaf with two dots 
above, causing the disappearance of the other ways of writing this letter. 15 If this 
hypothesis is right, it is noteworthy that, within the Arabic written tradition up to 
the first millennium, there is no confessional difference and distinction. 


12 See an example in Deroche 2004, 49. As for a Christian Arabic manuscript coming from 
the West and having the very same palaeographic peculiarities, see the bilingual (Greek-Arabic) 
parchment, Bibliotheque nationale de France, Suppl. Grec 911 (Gehin 1998,166,171). 

13 Sijpesteijn (2008, 515a) came to the same conclusions based on papyrus documents dating 
back to 7 ,h -8 ,h centuries. 

14 Although one cannot exclude a priori the possibility of the intervention of a second or later 
hand in order to explain this alternation — as Monferrer Sala supposed (2010,197) — one could 
wonder why the later hand would not systematically intervene in every qaf. 

15 It is also noteworthy to remark that, like in linguistics and textual criticism, the study of the 
early Arabic written tradition shows that peripheries are more conservative, as for the so-called 
maghribi script. 
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4 Early developments of the layout 

As mentioned above, the presence of split words at the end of lines is another 
peculiar feature of ‘early Abbasid script’. This feature seems to be characteristic 
only of codices dating back to the second half of the 8 th century up to the begin¬ 
ning of the 9 th . One can suppose that this special layout is due to the typical ten¬ 
dency of the early Abbasid era to fill the entire written area. In Sinai Arabic 154, 
whenever the text does not fill all the available space, it is possible to distinguish 
a stroke at the end of some lines: for instance in the following fols: 99v line 10; 
109v -1; llOr line 5; llOv lines 3 and 8. 



Fig. 3: Sinai Arabic 154 (end of 8 th c.), fols llOr line 5; llOv lines 3 and 8. 


I think the copyist might have used this technique only when he was unable to 
stretch the last letter of the line (the so-called masq technique), which however is 
widely used in the whole manuscript (Deroche 2000,187; Gacek 2001, 135). The 
lack of space between words is another consequence of the tendency to fill all the 
available written area. In the case of the Sinai Arabic 154 fol. 17v line 19 one finds 
a critical point which caused some problems of interpretation to philologists and 
editors. Samir rightly supposed that the words wa-fakka riqabana ‘and he untied 
our napes’ were connected to each other due to an error of the copyist (Samir 
1990-1991, 88-89; La Spisa 2014, 37). 16 

Vaticanus Borgianus 95 also shows several cases of words split at the end of 
lines. See for instance the following examples in fol. 16v. lines 2-3: tala—midahu 
‘his disciples’; lines 4-5: istir—ha ‘weakness’; lines 10-11: wa-’a—marahum ‘and 
he ordered them’. 

This very feature is widespread also in Islamic documents dating back to the 
same period; this confirms what we have already shown in the examples quoted 
above. 17 


16 It may be not by chance that this error occurred with a word whose first letter does not attach 
on its left as ra‘; see Deroche and Sagaria Rossi 2012,193. 

17 See for instance the following documents: Oriental Institute of Chicago n. 14046,17629,17631, 
17636, 17637, Vienna, Nationalbibliothek. Papyrus Erzherzog Rainer n. 734 (Abbott 1957, 32, 57, 
80; 1967,199, 207, 235). 
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Another point that should be highlighted is that when the script changes, as 
for instance in the passage from the ‘early Abbasid script’ to the longer-lasting 
naskhi script, the whole structure of the page changes consequently too. This 
change is probably strictly related to that of the material support of manuscripts. 
For instance, with the spread of paper in the Arab world and the progressive disap¬ 
pearance of parchment, layout techniques too were refined and improved. While 
we do not have any clear indication of the technique used for the justification of 
text before the introduction of paper, ‘the progressive introduction of the mistara 
led to a relative standardization of ruling types’ (Deroche and Sagaria Rossi 2012, 
123, Sagaria Rossi 2015,102). The upside-down trapezoid or triangle form of the 
colophons at the end of the epistles, treatises and prose works in general (Deroche 
and Sagaria Rossi 2012, 207) denotes not only a greater availability of paper from 
the economic point of view, but also the development of the art of mise en page. 

The technique of the trapezoid/triangle form at the end of the text (used not 
only for colophons) can be found in Christian Arabic texts too, as for example in 
the case of the Karsuni-Arabic Vatican Syrian 202 (17 th cent.) and the Swayr 323 
(123) (18 th cent.). So it is not difficult to conclude that only after the introduction of 
the paper in the Arab world did Muslim and Christian scribes alike feel that they 
could organise the layout of the page more freely. 



Fig. 4: Mar Yuhanna al-Sabig Monastery (Lebanon) - Swayr 323 (18 th c.), fol. 87r. 
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5 A confessional Middle Arabic feature? 

Islamic and Christian manuscripts share all the linguistic and palaeographic fea¬ 
tures mentioned so far. I now would like to deal with an orthographic peculiarity 
which I have found so far only in manuscripts belonging to the Christian Melkite 
milieu: the way of writing the prepositional group i> ( min ’agli ) ‘for the sake 
of’ in Arabic, with disappearance of the consonant hamza (glottal stop) and the 
resulting coalescence between preposition and name (J?^ minagli). Joshua Blau 
was the first who pointed out this feature in his Grammar of Christian Arabic 
(1966). 18 Blau described this phenomenon as an elision of the hamza (the glottal 
stop consonant) when it ‘occurs at the beginning of a word governed by a prepo¬ 
sition’ (Blau 1966,101-102). Another example is the case of min ’ayna which often 
becomes minen. However, while this last change is also frequent in Modern Arabic 
dialects, the use of minagli such as attested in eastern Middle Arabic texts, might 
implies a hybrid register between min ’agli and monsan (‘for, to, in order that’) 
which is extensively used in modern Syrian dialect (Barthelemy 1936,374; Cowell 
1964, 491). Since the omission of the hamza is a typical Middle Arabic feature, we 
are facing here a classical example of mixed Arabic between fusha and ‘ammi- 
yya. 19 The same orthographic phenomenon occurs also in two other manuscripts 
belonging to the same Melkite monastic milieu: the dayr al-Muhallis 1807 (Gun 
- Lebanon) and the Balamand 135 (Tripoli - Lebanon) both coming from ancient 
Arabic-speaking Greek Catholic and Greek Orthodox monasteries respectively, 
of the 17 th century (La Spisa 2012, 213). In conclusion it is noteworthy to remark 
that an orthographic phenomenon such as minagli for min ’agli, which is typical 
for manuscripts coming from southern Palestinian monasteries, is shared also 
by texts written within the wide area that goes under the name of Bilad al-Sam. 20 


18 See also Blau 2002, 35 §26 and La Spisa 2012, 213. 

19 In order to clarify this point it would be useful to remember that ‘[i]t would be wrong to 
suppose that every deviation in a written text is colloquial. Since people know that there is a 
difference between written and spoken language, they make a conscious attempt to write cor¬ 
rectly but in doing so sometimes overreact using forms that are neither colloquial nor standard’ 
Versteegh 2001,115. 

20 The sharing of this kind of linguistic phenomena within the Melkite Arabic tradition, led 
Joshua Blau to spoke about the Melkite Arabic literary lingua franca (Blau 1994; 2002, 72), on the 
same topic see also La Spisa 2012. 
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6 Concluding remarks 

From all the data listed above it is possible to infer some considerations about 
the interactions between the early Arabised Christian communities in the monas¬ 
teries of South Palestine and their Islamic religious environment. The first ques¬ 
tion I would like to pose is: is it possible to speak of new standards in the early 
Christian Arabic texts as compared with the Islamic standards (starting from the 
Qur’an and onward)? Just one century after the Arab conquests the Arabic spell¬ 
ing and grammar were almost the same for Christians and Muslims. Scholars 
having analysed the language of Christian Arabic manuscripts often concluded 
that there existed a so-called Christian Middle Arabic (Blau 1966-67,1994, 2002, 
Grand’Henry 2006). Nevertheless Samir Arbache (2008), who studied the mor¬ 
phological verbal system in the Sinai Arabic 72 codex, which dates back to the 9 th 
century, drew the following conclusions: 

Les textes en moyen arabe ont existe depuis les origines, c’est-a-dire depuis le debut de la 
litterature arabe ecrite. [...] Si tel est le cas, le moyen arabe ne peut plus etre envisage comme 
une transformation ou une regression de l’arabe classique. II sera plutot objet d’analyse 
comme un etat de la langue ecrite au meme titre que le dialecte ou la langue classique. 21 

Only after the normalisation process carried out by the Iraqi philologists of the 
9 th —iO th centuries is it possible to speak of specific palaeographic and linguistic 
choices due to the need to build and consolidate a confessional identity. At the 
very beginning of the Arabisation, the ‘Melkite Arabic’ church distinguished itself 
by choosing the Arabic language, as it was spoken and written by all the Arabic 
speakers in that time, as the official language of the church. It stands to reason 
to believe that in Christian milieus al-'arahiyya al-fusha did not have any liturgi¬ 
cal function or any specific religious meaning as it has in Islam. For this reason. 
Middle Arabic is much more pervasive even in liturgical, theological and patristic 
works of the ‘Melkite Arabic’ church. 

On the other hand, some centuries after the Melkite Church, the other ori¬ 
ental churches living in Egypt and in the Bilad al-Sam, started to produce their 
literary, theological and patristic heritage directly in Arabic because their liturgi- 


21 Arbache (2008, 19): ‘Texts written in middle Arabic have existed since the origins, i.e. since 
the very beginning of written Arabic literature. [...] If this is the case, the Middle Arabic should 
no longer be considered as a sort of transformation or corruption of Classical Arabic. It is rather 
a special variety of the written language which has the same importance as the dialect and the 
Classical language’ (my translation); for further reflections on the same question, see also Bettini 
and La Spisa (2012, viii-xii). 
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cal languages had become incomprehensible to Christian believers. The kind of 
Arabic they used is also called Middle Arabic, but in-depth studies trying to point 
out differences and similarities with the Melkite texts are still lacking. 
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Nuria de Castilla 

Uses and Written Practices in Aljamiado 
Manuscripts 

Abstract: In the Aljamiado texts, we see a consistent system used for the translite¬ 
ration of the Romance language in Arabic script. From the oldest copies preserved 
(15 th century) until the last ones produced by the Moriscos at the beginning of the 
17 th century, a standardised graphic and orthographic system has been applied to 
these Spanish texts written in Arabic script. 


1 Introduction 

After seven centuries of Muslim presence in Spain, Boabdil had to surrender the 
keys of the city of Granada to Ferdinand, the Catholic King, in 1492. Despite the 
‘Capitulaciones de Granada’, the agreement by which the Catholic kings showed 
an apparent kind understanding of the Muslim population (they could maintain 
their clothes, habits and rituals), in 1499, Cardinal Cisneros started to stress the 
necessity of religious unity. 

In 1502 in Castile, and 1526 in Aragon, all the Muslims (the Mudejars) were 
obliged to be baptised (which is to say, they had to convert to Christianity) or to 
leave Spanish lands. The conditions set for exiting the country were so harsh that 
most of the people stayed on in the Iberian Peninsula. If they decided to be bap¬ 
tised and to stay in their lands, they were no longer called Mudejars, but Moriscos 
or ‘convertidos de moro’. But what at the beginning was a religious matter 
became something wider: in 1504, Morisco communities had to pay special taxes 
(because they were converted), and from 1511, cultural differences were no longer 
accepted. Then, from 1516 the typical Morisco clothes were forbidden, as well 
as their music and zambras (dances), and of course, their food: not eating pork 
meant that you were a Muslim; at the same time, the Muslim way to slaughter the 
animals was banned too. 


This chapter is the developed version of the paper presented in the conference ‘Creating stan¬ 
dards: Orthography, Script and Layout in Manuscript Traditions Based on Arabic Alphabet’, held 
in Hamburg in October 2013. The research was carried out with in the fra me of the Coran a, Produc¬ 
tion and transmission of the Qur’an in the Western Islamic World, a project led by Nuria de Castilla, 
and financed by the Spanish Ministry of Science and Innovation (FFI2012-32294), and ERC Grant 
Agreement number 670628 SICLE (2016-2021). 

3 Open Access. © 2019 Nuria de Castilla, published by De Gruytei. |COThis work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-005 
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In the second half of the 16 th century, it was the turn of the books: in 1564, 
it was ordered to burn all the books in Arabic in Valencia. In 1564, Granadan 
Moriscos were enjoined by a Royal decree to learn Spanish within three years 
(Vincent 2006, 106). In 1567 in Castile, the use of Arabic language, written and 
spoken, as well as the right to own books in this language, was forbidden. Despite 
the interdictions, paradoxically, the greater part of the manuscripts which have 
remained from these Muslim communities were copied during this period, i.e., 
the second half of the 16 th century, when the bans were more intense, and the use 
of the Arabic script acquired specific values of resistance (Bernabe Pons 2010,30). 

Spoken Arabic first disappears in a general way in Castile and Aragon, but 
these two kingdoms conversely witness the use of the Aljamia, a written variant 
of Castilian with specific linguistic features. Arabic letters are usually used, alt¬ 
hough it is sometimes in Latin letters. The manuscripts in this area, hidden in 
the ceilings or in the walls, were discovered in the 16 th and the 17 th century, but 
mainly from the 19 th century to the present day. The most important trove took 
place in Almonacid de la Sierra. ‘In 1302/1884 hundreds of manuscripts came to 
light during building works under the floor of a house in the village of Almonacid 
de la Sierra, close to Saragossa, the capital city of Aragon. The volumes were pro¬ 
bably concealed there shortly before the expulsion of the Moriscos in 1018/1609 
and, when discovered, were well-arranged, standing side by side, together with 
some bookbinder’s tools’ (Martinez de Castilla 2014, 89). Other similar finds took 
place in Sevilla, Pastrana, Cutar, Ocana, Calanda, Torrellas, Novallas, Sabinan, 
Urrea de Jalon, etc.; however, the number of manuscripts discovered in these 
places was smaller. The whereabouts of some of them, like that of Pastrana, are 
unknown (Martinez de Castilla 2016); others are in private hands, as those of 
Ocana and Urrea de Jalon. Most are however kept in public collections: the extant 
manuscripts from the Almonacid de la Sierra trove are nowadays in the Tomas 
Navarro Tomas Library, CCHS-CSIC in Madrid (mainly) and Escuelas Pias of Sara¬ 
gossa; the manuscripts from Calanda are in the Library of the Cortes de Aragon 
(Saragossa), and so on (Villaverde 2010). Manuscripts hidden behind a wall and 
discovered during repair works in a house have been the major source of these 
witnesses of Morisco culture. However, other codices produced in the same area 
of Castile and Aragon were bought by bibliophiles in the 17 th century. This was 
the case of two Aljamiado manuscripts in the Bibliotheque nationale de France 
(Castilla, forthcoming), both having been acquired by Antoine Galland and inte¬ 
grated into the King’s library in Paris in 1686 according to the information coll¬ 
ected by Morel-Fatio (1982, vii). These manuscripts produced by Mudejars and 
Moriscos from Aragon and Castile exhibit two graphic systems (Arabic and Latin 
script) and two languages (Arabic and Aljamia). In this chapter, I will focus on the 
Aljamiado texts (a variant of Spanish, usually in Arabic script). 
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The word ‘Aljamia’ is derived from the Arabic al-‘ajamiyya meaning 

‘non Arabic language’. In our context, it was used among the Peninsular Muslims 
to designate the language used by their Christian countrymen. In the Dicciona- 
rio de Autoridades (1726), it is said that Aljamia is ‘the language or parlance the 
Arabs living in Spain used to speak with the Spanish Christians in order to make 
themselves understood in their contracts and dealings’, 1 i.e. it refers to a koine or 
Romance vehicular language. 2 In the Western texts of the 15 th to 17 th centuries, the 
word Aljamia is systematically identified with the meaning ‘Romance’, probably 
Castilian as can be deduced from the following example. 3 4 

Thirdly, that they should not speak Algarabia, but all should speak Aljamia, and that all 
the documents and contacts which used to be written in Arabic should be carried out in 
CastilianA 

Or, as shown by the following account: 

He was carrying secretly with himself a certain silver coin on which were letters and charac¬ 
ters in Arabic which read ‘ley lehe ele Ala’, i.e. they meant in Aljamia that there is no other 
Lord except God alone. 5 

The same applies to the Catalan translation of Tirant lo Blanc: 

Upon my honour, Sir, accompanied by these Moorish ladies came a very handsome girl who 
spoke very well Aljamia, and with much gracefulness, and if your Lordship would like to 
make me a favour, although I do not deserve it, when you take the city, you would make her 
a Christian and give her to me as wife. 6 


1 ‘La lengua o idioma que para entenderse en sus tratos y comercios hablaban los arabes que 
estaban en Espana con los cristianos espanoles’. 

2 For more information about the contexts in which this word was used during the Spanish 
Golden Age, see Martinez de Castilla 2006b, 235-246. 

3 The edition of the texts and the use of italics in the given examples are mine, as well as their 
translation. 

4 ‘Lo tercero, que no hablasen algarabia, sino que todos hablasen en aljamia, y que todas las 
escrituras y contratos que se solian hacer en arabigo, se hiciesen en Castellano’. Fray Prudencio 
de Sandoval, Historia de la vida y hechos del Emperador Carlos V, 1604-1618, ed. and study by 
Carlos Seco Serrano, in www.cervantesvirtual.com (accessed 2 October 2014). 

5 ‘Ocultamente traia consigo una cierta moneda de plata en que habia en ella letras y caracteres 
en lengua arabiga, que decian “ley lehe ele Ala”, que [en] aljamia querian decir que no hay otro 
senor sino Dios solo’. Diocesan archives of Cuenca (ADC), leg. 262, num. 3573. See Martinez de 
Castilla 2006b, 237. 

6 ‘Por mi fe, senor, en compania de aquellas moras viene una donzella muy graciosa que habia 
muy bien el aljamia y con mucha gracia, y si vuestra senoria me quisiese hazer tanta merced que, 
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In some cases, the adjective Aljamiado is used as a designation of those speakers 
who were fluent in both Arabic and Castilian, or for those who were living in 
aljamas, i.e., in Muslim neighbourhoods: 

In order to identify the aljamiados who did not learn from a child our language and its 
pronunciation, they had them say ‘cebolla’ [‘onion’] and the Morisco would say ‘xebolla’. 7 

Today, when speaking of Aljamia, we are referring to a variant of Castilian exhibi¬ 
ting a series of linguistic features which can serve as a basis for its identification: 

- influence of Arabic, not only in lexicon, but also in morphology and syntax; 

- influence of Aragonese, the Romance linguistic variety used in Aragon; 

- strong archaism. In some cases, it seems that the language belongs to the 
13 th —14 th century rather than to the 16 th century. 

These are perhaps the characteristics Pedro de Herrera was alluding to when he 
asserts, in 1618, that the moro is ridiculous for his language: 

Two by two, as for a fight, they wanted to jostle when a ridiculous Moor (according to his 
garments, his physical appearance and the aljamiado language of his voice), singing, con¬ 
vinced them to relinquish the civil litigations since their revolt was known and that they had 
been ordered to be thrown out of Spain. 8 

However, in spite of the possible influences of the spoken language on this lin¬ 
guistic variety used by Mudejars and Moriscos, the Aljamia was foremost a 
written language. 9 One of the reasons that leads us to maintain this statement 


aunque yo no lo tenga servido, que como tomeys la ciudad la hagays hazer cristiana y me la deys 
por muger’. Joanot Martorell, Tirant lo Blanc, Castilian translation, 1511, ed. by Martin de Riquer, 
Madrid, Espasa-Calpe, 1974, p. 172, see Corde, http://corpus.rae.es (accessed 2 October 2014). 

7 ‘A los aljamiados que no habian desde ninos aprendido nuestra lengua y su pronunciation, 
para conocerlos los hacian decir “cebolla”, y el que era morisco dezia “xebolla”’. Bernardo de 
Aldrete, Antig. de Esp. [Varias antigiiedades de Espana, Africa y otras provincias, Anvers, fuan 
Hasrey, 1614], I, 37. On the basis of this example, Cirot suggests that ‘aljamiado’ may have meant 
‘parlant arabe’ (Georges Cirot, ‘“Ladino” et “aljamiado”’, Bulletin Hispanique 38.4 (1936), p. 539; 
this meaning is already found in the Diccionario de la lengua castellana de la Real Academia 
Espanola de 1770: ‘lengua arabe corrompida que hablaban los moros de Espana’ (s.v. ‘Aljamia’). 

8 ‘Dos a dos, como para renir, querian acometerse, cuando un moro ridiculo (por el vestido, 
figura personal y lenguaje aljamiado de su voz), cantando, les persuadio dexassen las penden- 
cias civiles porque ya se sabia su rebelion y los mandavan echar de Espana’. Pedro de Herrera, 
Translacion del Santisimo Sacramento a la iglesia colegial de San Pedro de la villa de Lerma, 1618, 
in Ferrer Vails 1993, 273. 

9 The Aljamia ‘was probably not a faithful mirror of the variant actually spoken by these people’ 
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is the big linguistic difference between the texts translated from the Arabic - the 
most important part of the manuscripts produced by the Mudejars and Moriscos 
in Aljamia - and those which were a recent creation. When we speak of Aljamia, 
we speak thus about a written linguistic variant of Spanish, with a strong influ¬ 
ence of Aragonese, and in a large part Arabised and Islamised as much from a 
linguistic point of view as for its contents. 


2 The copyist and his milieu : time, place and 
society 

Who produced this kind of codices? Aljamiado manuscripts were copied within 
the Muslim communities in Spain: Mudejars (before the forced baptism in 1502) 
and Moriscos (after this date), from the end of the 14 th century until the beginning 
of the 17 th century. The earliest manuscripts that have come down to us are proba¬ 
bly from the 15 th century, but the majority of the manuscripts date to the 16 th and 
17 th centuries. 

This production was localised in Aragon and Castile, not in Granada or in 
Valence. The various Spanish kingdoms have very different history, and the same 
applies to the Muslim communities living within each of them. Toledo was conque¬ 
red in 1085, whereas Granada in 1492: four long centuries separate the two events. 
One can understand that the Castilian Muslim communities were much more assi¬ 
milated in 1499 (the year when Cardinal Cisneros initiated the policy of instituting 
religious unity in the kingdom of Castile) than the Granadian ones. On the other 
hand, in the 14 th century, which is probably when the Aljamiado phenomenon 
started, Granada was still a Muslim kingdom, and Valencia had been conquered 
one century before. This is the reason why Aljamia did not develop in every kingdom 
in Spain but only in Castile and above all in Aragon. In these kingdoms, Muslims 
had to struggle to preserve the knowledge of Arabic as well as their cultural identity. 

These manuscripts were mainly written (copied or produced) by some uni¬ 
dentified scribes for the faqihs in order to maintain the cohesion of their com¬ 
munities and to control them, at a moment when they were losing their rites and 
cultural practices (de Castilla 2006a). Although we find familiar — or low quality 
copies — probably written within family circles, most of the manuscripts were 
produced by skilled copyists. 


(‘no debia de ser un espejo fiel de la variedad hablada por aquellas poblaciones’). I am indebted to 
Olivier Brisville-Fertin for generously giving me access to his unpublished work-in-progress (p. 5). 
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3 Linguistic features 

3.1 Graphic representation 

Orthographic conventions are quite stable throughout the period concerned by 
this production, as we shall see later. However, the language itself varies accor¬ 
ding to the text typology, but it will remain in any case basically Romance: if 
the text is a translation of an earlier Arabic work, the text will exhibit a higher 
degree of Arabisation and archaism than a copy or adaptation of a European text 
or than a new composition produced in a Mudejar or Morisco context (Sanchez 
1995, 339-348 and Montaner 2004, 99-100). 

Since Spanish has a few phonemes that are not found in Arabic, Mudejars 
and Moriscos developed new graphemes in order to represent them: 

1. Vowels: In Spanish there are five vowels (/a/, /e/, /i/, /o/, /u/), whereas in 
Arabic there are only 3 (/a/, /i/, /u/). For the vowel /e/, the Mudejars adopted 
a new orthographic combination, fatha plus alif, which was inherited by the 
Moriscos. Ex/u^ = <mesa>. For the vowel /o/, there is no innovation, the 
damma being used for both /o/ and /u/. 

2. Consonants: The sadda, a diacritic used in Arabic system to geminate a con¬ 
sonant, is systematically used in order to provide solutions for rendering 
Spanish phonemes, as follows: 

ba’ with sadda = /p/ <p>. 

- jim with sadda = /c/ <ch>. 
ra’ with sadda = /rr/. 

sin with sadda = /s/ <x>, adapted for the voiceless palato-alveolar fricative. 10 
nun with sadda = /n/. 

The various emphatic letters (-h, -h , , o- 3 and j) are in most of the cases used 

for writing down Arabic words, not Spanish ones; examples like the following 
are common: <a‘ssala>, written with o- 3 , but <a?otea>, with lp. However, in the 
Aljamiado texts in Latin script, a <$> is used for both cases: ‘agala’ and ‘a^otea’, 
as the regular way to write these words in the 16 th century Spanish. 


10 Although this way of writing is the most commonly found, there are other manuscripts — 
Poema de Yuguf, the manuscript of Urrea, BNE 5267, BNE 5305, BNE 5313, BRAH T12, BRAH T13, 
BRAH T18, RESC/13, RESC/30, RESC/33, RESC/37, RESC/52, RESC/64, RBME 1880, BPal 3226, BnF 
Arabe 774 or BnF Arabe 1163— in which the letter sin without tasdid is used exclusively to indica¬ 
te both the apico-alveolar and the pre palatal fricative voiceless. 
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3.2 Consonant clusters 

There is a tendency in Aljamia to maintain some of the features found in Arabic, 
such as the syllabic structure. Although some Arabic dialects have initial or final 
consonant clusters, Andalusi Arabic did not allow a sequence of two consonants 
in the same syllable without an intermediary vowel. In other words, the only pos¬ 
sible syllabic structure is CV(C) (Corriente 2002,35). The Aljamia usually introdu¬ 
ced an epenthetic vowel, the same as in the following syllable, between the con¬ 
sonants in order to eliminate the tautosyllabic consonant clusters in the original 
Romance words, for example, ap e remiar, apdagado, atrevimiento. 

According to Quilis (1981, 298-300), in Aljamia, the epenthetic vowel was a 
merely graphic component unlike in Classical Arabic where the epenthetic vowel 
is usually phonetically realised. Nevertheless, this svarabhakti element (automa¬ 
tic vowel) seems to have had a phonetic representation, a conclusion reached by 
Corriente (2000-2001,117) in relation to the botanical glossary of Abulxayr: 

The disjunctive vowel was not a mere graphic tool meant to avoid an aberrant orthography 
within the Arabic script, but a phonic reality due to the interference of the syllable taxono- 
mical rules. 11 

This view, if applied to Aljamia, would entail that it was ‘a tongue phonetically 
interfered with, in this case by Arabic’ 12 (Corriente 2000-2001,117). In the same 
way, Labarta (1982, 231) states in her analysis of an Aljamiado page from Tara- 
zona that 

[t]he Castilian sequences formed by occlusive + liquid + vowel produce between their first 
two components a vocalic element with the same quality as that of the following vowel. 
Such a sound, which does not have for us a phonological value and does not have any 
graphic representation, was perceived and noted in Aljamia either because the Arabic does 
not allow for a syllable to begin with two consonants, or due to the nature of Aljamia itself, 
that reproduces through the Arabic script the spoken Castilian in an ‘acoustic’ (not ‘phono¬ 
logic’) way with assimilations, contractions, and so on. 13 


11 ‘La vocal disyuntiva no era un mero recurso grafico para evitar una ortografia aberrante en 
la escritura arabe, sino una realidad fonica debida a la interferencia de las reglas taxonomicas 
de la silaba’. 

12 ‘una lengua interferida, foneticamente en este caso, por el arabe’. 

13 ‘Las secuencias castellanas formadas por oclusiva + liquida + vocal crean entre sus dos 
primeros componentes un elemento vocalico de timbre similar al de la vocal que lo sigue. Tal 
sonido, carente para nosotros de valor fonologico y de representation grafica, era percibido y 
marcado en la escritura aljamiada, ya como consecuencia de que el arabe no permite que una 
silaba empiece por dos consonantes, ya debido al propio caracter de la escritura aljamiada, la 
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3.3 Sibilant consonants 

Although the graphical difference between /s/ = <_£ and /§/ = is indicated in 
some manuscripts, 14 60% of the corpus analysed does not express this distinction. 15 
This implies that this phonetic difference is not so pertinent in Aljamia as has 
been argued 16 and it is not rare to find sarabe instead of xarabe (‘syrup’) - which 
could be explained by the Arabic etymology of the word sarab (‘beverage’) -, 17 
or diso instead of dixo (‘he said’). The cases which exhibit this graphical distinc¬ 
tion could be interpreted as follows: either this is the result of the realisation of a 
phoneme non-existent today, or the copyist was very familiar with the texts in Latin 
script and tried to emulate them in Arabic script, introducing the shadda above the 
sin for the <x>. 18 As Montaner (2004,100, n. 2) accurately indicated, ‘except for the 
limited influence of the Andalusi substrate on the phonology and the lexicon, the 
Alj amia never stopped being a Romance language, even in its most Arabized texts’. 19 


3.4 Dental consonants 

The most common letters representing the dental consonants in Aljamia are \ i, 
including the emphatic consonants -L / This is probably the result of a plausi¬ 
ble loss of consonantal emphasis among Mudejars and Moriscos (Bouzineb 1986, 
30-31). There is a distinction between the fricative and occlusive allophones [3] 
and [d], but it is not so common. This distinction is more regular in later man- 


cual, mediante la gratia arabe, reproduce el Castellano hablado de una forma “acustica” (no 
“fonologica”), con sus asimilaciones, contracciones, etc.’ Nevertheless, when reading her study, 
one has the impression that this ‘vowel’ will always be found in the Tarazona folio. However, 
this is actually not the case in graphic representation of the word libras in BRAH T19, where a 
sukun appears between the /b/ and the /r/. On the other hand, this epenthetic vocalic element 
occurs in the other three words that have consonantal groups (gruesa, tres y maestro ). For more 
information about the distribution of the epenthetic vowel and suldin, see Martinez de Castilla 
2010,182-187. 

14 BRAH T13, BRAH T18, BRAH T19, BNE 4953, BNE 5223, BNE 5377, BNE RES 245, RESC/62. 

15 That is to say, Poema de Yu^uf, Urrea manuscript (particular collection), BNE 5267, BNE 5305, 
BNE 5313, BRAH T12, BRAH T13, RESC/13, RESC/30, RESC/33, RESC/37, RESC/52, RESC/64, RBME 
1880, PAL 3226, BnF Arabe 774, BnF Arabe 1163. 

16 Galmes de Fuentes 1970, 220-221, and several studies since then. 

17 Steiger 1991, 53, n. 3. 

18 More information in Martinez de Castilla 2010,177-178. 

19 ‘salvo la limitada accion del sustrato andalusi en la fonologia y el lexico, la aljamia sigue 
siendo netamente una lengua romance, incluso en sus textos mas arabizados’. 
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uscripts, and a possible phonological interference by the Andalusi seems less 
likely than the cases with the sibilants as in the example of /sarabe/ above). As 
described by Navarro Tomas (1918, 99), ‘the occlusive articulation is produced 
when the dental consonant appears [...] after a “n” or “T”. For example, in the 
manuscript BRAH T19, undated but copied at the beginning of the 17 th century 
(Castilla 2019), we read on the one hand: 


el dia 


(24r, 6) 

al dueno 


(112v, 3) 

y-el dayuno 


(182 r, 8) 

en desierto 


(84r, 15) 

en dineros 


(122r, 4) 

en demandarlo 


(140v, 4) 


And on the other hand: 

liil 

jt-N.tnUa (jli 


adonde 

conpania del mensajero 
delante de los onrados 
a doze dias de la luna 


( passim ) 
(lr, 2-3) 
(lr, 9) 

(2Or, 9-10) 


4 Contents 

As far as the contents of the manuscripts are concerned, most of these codices 
are miscellanies - or multiple-text manuscripts -, but there are also composite 
and unitary copies. Notably, we can find copies of excerpts of the Qur’an, hadith 
(traditions attributed to the Prophet Muhammad), fragments of juridical texts, 
magic, and so on. Exceptionally, there are some documents written in Aljamia 
(personal letters and legal documents), but unfortunately only few items have 
been preserved (see, for instance, Hoenerbach 1965, Labarta and Garcia Carcel 
1981, Viguera 1982 or Viguera 1991). 

The Alj amiado texts are markedly Islamic and this applies to both Mudejar and 
Morisco periods. The production was carried out during an earlier stage mainly in 
Aragon, in Castile, and later, as far as we can establish, in Algeria, Tunisia and in 
the heart of the Ottoman territories, in Salonica and Constantinople, during the 
Morisco period and after the expulsion. It can therefore be defined as an ‘Islamic 
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literature made [...] by Muslims and for Muslims’ 20 (Bernabe Pons 2010, 27). It is 
‘an Islamic variant of Spanish’, as Ottmar Hegyi (1985) aptly defined it. 

The manuscripts can be divided into three main groups: a) translations of 
earlier Arabic texts, the date of translation remaining unclear; 21 b) copies and 
adaptations of Western-European texts; and c) new works. A taxonomic distinc¬ 
tion has been traditionally maintained between the Aljamiado manuscripts in 
Arabic letters and those in Latin script, the former being called ‘Aljamiados’ and 
the latter ‘Moriscos’. For this reason, several mentions of ‘manuscritos aljamia- 
do-moriscos’ appear in specialist publications dealing with this production, both 
in articles and catalogues; this is a descriptive phrase that allows us to jointly 
treat copies written in Mudejar and Morisco communities where the two scripts 
are used. 22 However, such description seems confusing since when we find a man¬ 
uscript in Arabic script and another one in Latin characters containing the same 
text(s), the differences between both are in most cases strictly graphic. In this 
way, neither the state of the language of a given Aljamiado text, nor its place of 
production can be defined on the basis of its script (Latin or Arabic characters). 23 
For instance, ‘The story of Abu Sahmah when his father, ‘Umar, sent him to be 
whipped’ 24 is found in various Aljamiado codices (Fig. I): 25 they are all in Arabic 
script, with the exception of BNE 6016, written in Latin letters. BNE 6016 belongs 
to the same textual tradition as the three other witnesses of the text which have 
been preserved in the manuscript of Urrea de Jalon, BRAH T12 and BnF Arabe 774. 


20 ‘Una literatura islamica, hecha [...] por musulmanes y para musulmanes’. 

21 Although the knowledge of Arabic was almost completely lost in Castile and Aragon, re¬ 
search over the past few years have been throwing light on the possible use, albeit in a minority 
way, in the Aragonese area. Cf. Ferrando 1996, 177-195; Ferrando 2000, 195-200; Garcia-Arenal 
2010, 295-310. 

22 As an example, it will be sufficient to mention publications like Galmes de Fuentes 1998; 
Galmes de Fuentes 1986; or Vespertino 2002-2004. 

23 Although in initial studies of the Aljamiado literature it was assumed that there was a correla¬ 
tion between the use of the Arabic script and the Aljamiado texts written in the Iberian Peninsula 
on the one hand, and between the use of the Latin characters and the production in the exile 
after the expulsion, texts written in Latin script in the Iberian Peninsula and conversely others 
written in Arabic outside the Peninsula have been found. Cf. Suarez 2004, 20 and 21 and the 
bibliography cited there. 

24 ‘El hadiz de Abu Xahma cuando lo mando a^otar su padre Omar’. 

25 Taken from Martinez de Castilla 2010, 308. 
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Fig. 1: Genealogical textual schema of ‘The story of Abu Sahmah’. 


In some cases, we know that an Aljamiado text in Latin script could have been 
transcribed from a copy in Arabic characters (Fig. 2). This is the case of the 
‘Prayer for the rite of the child’s fadas’ (i.e. the Muslim rite for giving a name to 
a newborn) 26 in BCM T232, in Latin letters, which was apparently copied from 
BRAH T19, unless both manuscripts stem from the same common exemplar 
(Martinez de Castilla 2010, 344). 


I "Uli „ 


i . - n 

^ 9x4**-- 

I*.*,**- ****Jp/tML- /no, 

(/ 


Fig. 2: Fragment of the ‘Prayer for the rite of the child’s fadas’. Left: BRAH T19, fol. 24v. Right: 
BCMT232, fol. 295r. 


Also, in a few instances Spanish texts have been transliterated in Arabic script, 
either faithfully to the original or with changes. This way, a text produced in 
another culture and widely spread, serves as a basis either for a transliteration 
in Arabic script, as was done by the copyist of The sayings of the seven wise men 
from Greece (Dichos de los siete sabios de Grecia), 27 or for an adaptation to an 


26 ‘Rogaria para las fadas de la criatura’. 

27 Edited by Galmes de Fuentes 1991,47-54. See also Castro (ed.) 1990,407-410 and Ramos 2012,848. 
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Islamic cultural context: this is the case with The Handless Maiden (La Doncella 
de las manos cortadas), a well-known legend in Middle Ages and still circulating 
in the oral European tradition (Thompson (1970, T411.1). 28 About the latter, Mary 
Elizabeth Perry (2005, 97) stated that ‘the story of Carcayona reads as a Muslim 
version of the Handless Maiden tale’. 

The language of the translated texts and of the new ones is therefore diffe¬ 
rent. The new texts use a language closer to the “standard” use of the Christian 
Spanish texts than the copies of the translations, much more influenced by the 
Arabic; in other words, the use of the Arabic alphabet does not induce any special 
differentiation in the kind of language employed in these new Morisco works. 
Here is an example of a text written in Arabic script, produced in the 16 th century, 
to help the Moriscos to leave Aragon in order to arrive safely to Salonica: 

[Itinerary from Spain]. [...] From there to Verona. Do not go through the city as you would 
pay a real per head. There you will ask for the road to Padua. There you will take a boat to 
Venice; from Venice to Bolonia [Bologna] or to Durazzo, or to Lesos or to Castelnou, that of 
these parts you will find first. You will enter an inn; you will pay half a real per day; and do 
not take anything from the inn as they will charge you triple. 29 

The language of the transliterated texts of Western European origin does not 
change either: 

[Sayings of Bias]. These are the sayings of Bias, and they are as follows: and in order to be 
well understood, the reader should think that each Sage is speaking with him: Look at your¬ 
self every day of your life in a mirror, take this advice from me. 30 

On the other hand, the texts resulting from a translation of an Arabic original 
exhibit a different state of the language (Fig. 3): 

The hadith of Abu Sahmah when his father, ‘Umar, God may be pleased with him, sent him 
to be whipped. ‘Abdallah b. ‘Umar said: ‘Umar b. al-Khattab had a son called Abu Sahmah 


28 Valero Cuadra (ed.) 2000; Perry 2005. See also Martinez de Castilla 2006c. 

29 BnF Arabe 774, fols 37v-38v. My own transliteration and translation. The text has been edited 
by Sanchez Alvarez 1982,153-154. ‘[Itinerario de Espana]. [...] De alii a la Verona. No paseis por de 
dentro de la ciudad, que pagares a real por cabe^a. Alii demandares el camino para Padua. Alii 
os embarcares para Venecia; de Venecia para la Bolona [sic] o para Duracio, o para Lesos o para 
Castelnou, el que antes halles d’estos puertos. [...] entrares en una posada [...]; pagares medio 
real por dia; y no tomeis nada de la posada, qu[e] os aran pagar de uno tres’. 

30 BnF Arabe 1163, fol. 60v. Edited by Galmes de Fuentes 1970, 41. ‘[Dichos de Bias] Estos son los 
dichos de Bias, los cuales son los siguientes: I para ser bien entendidos, piense el le[c]tor que cada 
sabio habla con el: Mirate todos los dias que vivieres al espejo; toma de mi esti consejo’. 
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who was a reciter of the Qur’an and when he was reciting it seemed that the Messenger of 
God was reciting. And Abu Sahmah fell ill, with a very bad illness, and the colleagues and 
friends of the Prophet Muhammad were visiting him. One day, when he went there, they 
gathered in ‘Umar’s house and told him: ‘0 prince of the Faithful! If you made a promise, as 
did ‘Ali b. Abu Talib for al-Hasan and al-Husayn until God restored them to health, maybe 
God would restore to health your son Abu Sahmah and he would be healthy’. 31 
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Fig 3: Bibliotheque Mejanes 1367, Aix-en-Provence, fol. 213v. 


31 Aix 1367, fol. 213v. Edited by Martinez de Castilla 2005, 507. ‘El hadiz de Abu Xahma cuan- 
do lo mando aqotar su padre Omar, acontentese Allah d’el. Dixo Abullahi ibnu Omar, apaguese 
Allah d’el: Era Omar ibnu Elhatab que tenia un fijo que se llamaba Abu Xahma, y era Uedor del 
Alcoran, que semejan en su leer al leer del mensajero de Allah (s’m). Y enfermo Abu Xahma una 
enfermedad muy fuerte; y era la conpania del anabi Muhamad (s'm) que lo visitaba. Pues cuando 
fue un dia, ajuntaronse en la casa de Omar y dixeronle: “Ye rey de los creyentes, si prometieses 
una promesa, asi como hizo Ali ibnu Abi Talib por Alhaqan y Alhugayni hasta que les invistio 
Allah la salud, por ventura que tu fijo Abu Xahma que le daria Allah salud y estaria luego sano’”. 
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We could thus conclude that the archaic state of the last text, as well as that of the 
other translations from the Arabic, can be at least partly explained by the conser¬ 
vative nature of the translations, some of them made probably in the 13 th century, 
and copied again and again. However, even in the translated texts — the texts of 
Arabo-Islamic origin — we appreciate the influence exerted by the “standard” 16 th 
century Spanish, used by the Christians. Then: 

Some copyists were aware of the linguistic trends of their times. This explains 
the presence of learned and literary words as well as Latinisms, related in 
almost all cases to specialised botanical vocabulary: junqueruela ‘reed’, sis ha 
‘fruit of the jujube tree’, pollicios ‘sprouts’ or ixola ‘prickly and bitter plant’ — 
in some cases they come from Latin through Arabic loans. I have also found 
a few hapax — words for which there is no documentary evidence before in 
any other place, in Christian or Muslim context, as revida (‘another life’) or 
hoticaxear (‘put makeup on someone). 32 

From a morphological point of view, some copyists used the synthetic super¬ 
lative in -isimo, in words like noblisimo, very common in Spain at the end of 
the 16 th century but never found before that time. That period coincides with 
the last copies in Aljamia, which strongly suggests that the copyist had the 
knowledge of the linguistic uses of their time. 

In these cases, it shows that the copyists were educated persons, attentive to the 
various linguistic innovations of their time and more used to Spanish texts in 
Latin script than in Arabic script. They probably knew Arabic quite well, without 
being well versed in the language since many mistakes are found in the Arabic 
fragments. In general, the texts in Arabic show more errors than the Aljamiado 
texts, and in many cases, like BnF Arabe 447, the orthography in Latin script is 
more accurate than in the Arabic one. 

Apart from the use of a specialised vocabulary and application of the new 
morphological derivations found in the rest of the contemporary Spanish lite¬ 
rature, other features, such as divisions of words and phrases, help us to detect 
some copyists’ knowledge and familiarity with Spanish texts in Latin script. Thus, 
while in the Middle Ages every kind of grouping of words is found, in the 16 th and 
17 th centuries the graphic agglutination is more limited, and we only find phrases 
like enella o d’ellos (= preposition + article) written as a single graphic group. On 
the other hand, the divisions of the words at the end of a line seem to be random 
in the Middle Ages. However, in modern times, the 16 th and 17 th centuries, the divi- 


32 The Arabisms found in Aljamia often play the role of the Latin loan words in the contemporary 
Christian prose (Montaner 2004, 99-204). See Martinez de Castilla 2010, 210-218. 
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sion (when it occurs) tends to be syllabic: des-/pues ‘afterwards’, and is not only 
found at the end of a line, but at the end of a folio (recto or verso) too, even if in 
Arabic this segmentation is not possible. 

Then, we have to take with caution some conclusions advanced by Hegyi 
(1981, 22), and widely accepted until now, that a) the Aljamiado manuscripts 
were produced by isolated individuals whose only concern was to preserve the 
religious tradition for the next generations; and b) due to their social exclusion, 33 
they kept an archaic language, far from any linguistic innovation, because they 
‘do not share the linguistic ideal of the Renaissance, and they stay on the side¬ 
lines of the linguistic movement which heightens the Spanish language through 
the cultisms from Latin and Italian’. 34 


5 Conclusion 

In the Aljamiado texts, we see a consistent system used for the transliteration of 
the Romance language in Arabic script. From the oldest copies preserved (15 th 
century) until the last ones produced by the Moriscos at the beginning of the 17 th 
century, a standardised graphic and orthographic system has been applied to 
these Spanish texts written in Arabic script. However, it is impossible to assert 
when this system started to be used, because of the lack of evidence from more 
ancient manuscripts. On the other hand, with the exception of a later Aljami¬ 
ado copy of the end of the 17 th or beginning of the 18 th century, kept in a private 
collection in London, no hint of the possible use of other transliteration systems 
has been preserved, which impedes us from suggesting any hypothesis about the 
period of use and the selection criteria of one or another system. 

Concerning the language, the standardisation process is double in Aljamia: 
on the one hand, the translations of the Arabic texts show a peculiar version 
of the Spanish language, characterised by a higher proportion of Arabisms (in 
lexicon, syntax and morphology), Aragonese layer and archaisms. These three 
features, very consistent throughout the Aljamiado production, show a conserva¬ 
tive stage of the Spanish language - probably artificial - in the 16 th century, the 
period when most of the preserved manuscripts were copied. However, this con- 


33 ‘Las condiciones socioculturales no permiten la plena participation del grupo minoritario en la 
cultura oficial del grupo mayoritario, impidiendo que las novedades lingiiisticas se filtren hasta ellos’. 

34 ‘No comparten el ideal lingiiistico del Renacimiento [y, por extension, de conocimientos e in¬ 
quietudes], y se quedan al margen del movimiento lingiiistico que enriquece la lengua espanola 
por medio de cultismos procedentes del latin y del italiano’. 
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servative stage does not result from a deficient knowledge of the contemporary 
uses of Spanish. Moriscos were aware of the standardised use of the language of 
their time, and it is clearly demonstrated in the faithful and correct translitera¬ 
tions of Christian texts in Arabic script; or even more, they do use this normalised 
Spanish language to produce new texts (rather than copies of translated Arabic 
texts). In most of the cases, it is impossible to ascertain only on the basis of the 
language itself whether those texts were written by a Morisco. 

With the exception of the exact transliteration of some fragments of literature 
that were very popular in contemporary Europe, Aljamiado manuscripts show a stan¬ 
dardised kind of content: an Islamic one. The texts are either directly related to the 
religion, such as prayers, Qur’anic passages, Islamic law, exegetical commentaries, 
polemical texts about religion, etc., or indirectly, such as edifying literature, magic, or 
entertaining literature. The main part of this production shows a consistent common 
thread: it gives an easy access to the basic knowledge of what Islam means and what 
is a good Muslim is required to do. The information can be provided in Arabic letters 
(in most of the cases) or Latin script. Even if many hypotheses about the reasons of 
the use of one or the other alphabet have been argued (period, place, familiarity with 
the Arabic language and script, cultural identity, etc.), we still lack a definitive expla¬ 
nation (if this is possible at all) about the selection of one or the other script system. 
Although they are different, both are used in a consistent and standardised way. 


Abbreviations 

Aix Bibliotheque Mejanes, Aix-en-Provence 

BCM Biblioteca de Castilla-La Mancha, Toledo 

BNE Biblioteca Nacional de Espafia, Madrid 

BnF Bibliotheque nationale de France, Paris 

BPal Real Biblioteca. Palacio Real, Madrid 
BRAH Biblioteca de la RealAcademia de la Historia, Madrid 
RBME Real Biblioteca del Monasterio de San Lorenzo de El Escorial, Madrid 
RESC Biblioteca Tomas Navarro Tomas, Consejo Superior de Investigaciones Cientificas/ 
Centro de Ciencias Humanas y Sociales, Madrid 
T Biblioteca de la RealAcademia de la Historia, Madrid (old call number) 

Urrea private collection (manuscript found in Urrea de Jalon), Saragossa 
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Jan Schmidt 

How to write Turkish? The Vagaries of the 
Arabo-Persian Script in Ottoman-Turkish 
Texts 

Abstract: The chapter discusses the development of the orthography of Turkish 
texts in, what the Turks usually call ‘the old script’, i.e. the Arabo-Persian alphabet 
in its various forms as it appears in texts handwritten, printed and engraved, which 
were produced in the Ottoman Empire and the Republic of Turkey before 1928. 


1 Turkish (Turkic) texts before the Ottomans 

Turkish has been written since the 8 th century, perhaps even earlier, although no 
texts have survived from this earlier period. The first surviving texts were written 
in a ‘runic’ alphabet found on inscriptions in stone and on paper in Central Asia. 
It was based on the Aramaic alphabet in its Iranian form, possibly together with 
some letters adapted from the Greek (Hephtalite) alphabet, as Sir Gerard Clauson 
has suggested (Clauson 1970). This alphabet was in use for two centuries and was 
mainly employed for monumental inscriptions by local rulers who were part of 
the Goktiirk (‘Celestial Turkish’) confederation. 

With the rise of the Uighur Khanate (principality) from 742, a second alpha¬ 
bet, or rather a group of alphabets, came into existence. This used a cursive script 
that had its origin in the cursive Sogdian script, 1 which, in turn, was based on the 
Aramaic script. The script was mostly used for Christian, Manichaean and Bud¬ 
dhist texts. In 763, Manichaeism, spread by Sogdian preachers, was declared to 
be the state religion by the Uighur Khan, Bogii Kagan; after the fall of the Uighur 
Empire in the 840s, the Uighur Turks moved to the oases of eastern Turkestan. 
Some of them converted to Buddhism (and later to Islam). In the 13 th century, 
Uighur Turks still played an important role as clerks, teachers and even ministers 
in the Mongol Empire, where the Uighur script also began to be used for Mongol 


I would like to take the opportunity to thank Barbara Flemming and Edith Ambros for their help 
and useful advice with this paper. 


1 The Sogdians were an Iranian people from Central Asia, best known from their role as mer¬ 
chants on the Silk Road. 
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texts. Unlike the runic script, the Uighur script was not forgotten after the politi¬ 
cal power of the people who used the script had waned, but kept its prestige, even 
once also after most Turks had converted to Islam. 

Surviving copies of an important Turkish text of the early Islamic period 
entitled Kutadgu Bilig (‘Fortune-bringing Knowledge’), a work in the ‘mirror of 
princes’ genre from the 11 th century written by Yusuf Khass Hacib of Balasagun 
for the Karakhanid prince of Kashgar, were not only written in Arabic, but also in 
the Uighur script. Later the script was still occasionally used in the chancelleries 
of the Timurids and Ottomans. The latest text in the script was perhaps the ‘pro¬ 
clamation’ meant as a political statement to other rulers of Turkish and Jingizid 
descent, in which the people of the vilayet-i Rum (Anatolia) were notified of the 
defeat of the Akkoyunlu prince Uzun Hasan at the hands of Sultan Mehmed the 
Conqueror in 1473 at the battle of Tercan (Otlukbeli). The text is preserved in the 
archives of Topkapi Palace in Istanbul. 

With the adoption of Islam, the Arabic alphabet began to be used for texts 
in the Turkic languages of Central Asia. This was particularly stimulated by the 
voluntary conversion to Islam of the aforementioned Karakhanid dynasty in 960 
(the dynasty ruled from 840 to 1212 in Turkestan). During their rule, Mahmud al- 
Kashgari, possibly a member of the ruling dynasty, wrote a work on the Turkish 
language in Arabic for the Abbasid Caliph in Baghdad, which was entitled Diwan 
Lughat at-Turk (‘Compendium of Turkish Dialects’). This was in the 1070s. 

In the period between the 11 th and 13 th century, Turkish (Turkic) hardly 
appeared at all as a written language in the Islamic domain. This changed in the 
13 th century, albeit hesitantly at first, when Turkish (Turkic) began to be written 
again in Anatolia, where the Ottoman state had its cradle during the later years 
of the Rum-Seljuk sultanate. The Seljuks, despite their Turkic background, had 
used Persian as their official language ever since it had begun to dominate the 
area in the 11 th century. The power of the sultanate slowly crumbled and a number 
of small principalities arose, including the Ottoman realm, which ultimately took 
its place. Its rulers began to use Turkish - or, to be more specific, ‘West Oghuz 
Turkic’, the language spoken by the Turks who had migrated south-westwards 
into the area over the previous centuries - as their language of administration. 
‘There was then’, in the words of Celia Kerslake (Kerslake 1998,179), ‘a concomi¬ 
tant upsurge of Turkish literary activity, much of it religious-didactic in character 
and often consisting of translation or adaption from Persian’. In the early period, 
before about 1300, this literature still showed evidence of linguistic influence 
from the older Karakhanid literary tradition, but that impact disappeared soon 
afterwards. What these early texts shared was a tendency to use - in the Uighur 
way - a spelling which employed very few vowels, whereas later Ottoman texts, 
appearing after approximately 1500, were written in a more explicit spelling. 
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2 Turkish texts in the Ottoman Empire 

With the transformation of the Ottoman principality, which was founded around 
1300, into a regional empire ‘the Ottoman court and ruling class acquired a con¬ 
sciousness of imperial power [and] certain stylistic registers emerged - in which 
[written Turkish] was all but submerged beneath a heavy overlay of Arabic and 
Persian elements. These reflected the dominance of Arabic and Persian in the 
Islamic high culture and learning which the Ottoman elite sought to emulate and 
advance’ (Kerslake 1998,179-180). These Arabic and Persian elements were not 
restricted to lexical features but included grammatical and syntactic ones as well. 
This went so far, in fact, that some works penned in this elevated literary style in 
the 17 th century, when the development reached its peak, look like Persian texts 
with a Turkish conjunction or verb here and there. The spelling of these Arabic 
and Persian elements remained unchanged, but in some popular texts written by 
less well-educated people (and which often had their origins in oral literature), 
one sometimes finds orthographic adaptations based on the pronunciation of 
such foreign elements by Turks (see below for an example). 

As for the orthography of the Turkish elements, no consensus was reached 
among writers and copyists about the right form, so it remained inconsistent. 
Only during the period of reform in the 19 th century when manuscripts were repla¬ 
ced by printed texts were the first attempts made ‘to define and regularize the lan¬ 
guage’, in particular for didactic purposes. Reform also meant reform of public 
education, and Turkish became part of the curriculum in elementary and secon¬ 
dary schools for the very first time. The language was now specifically called 
‘Ottoman Turkish’ (Kerslake 1998,180) and developed into something resembling 
standard Turkish. 

So, unlike Arabic, literary Turkish was not pressed into a ‘classic’ or ‘stan¬ 
dard’ Turkish at an early stage. The spelling of the Turkish elements in the lan¬ 
guage in the older period was influenced by various complicating (and little-stu¬ 
died) factors, such as regional differences in the spoken language. The expansion 
of the Ottoman state and the migration of Turkish-speaking subjects within its 
expanding borders, which came to include North Africa, the Near East and south¬ 
eastern Europe, meant that the kind of Turkish spoken in Niksar in 1500 must have 
been different from Turkish spoken in Sarajevo the same year. Another factor was 
the ‘natural’ development of Ottoman Turkish: Turkish spoken in Niksar in 1500 
certainly was different from Turkish spoken in the same town in 1820. Change in 
the language, such as the gradual change in the system of vowel harmony, did 
not always lead to a change of spelling (a phenomenon also known in English 
and other languages), hence the orthography of late Ottoman Turkish contained 
a number of archaisms (see below). 
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To a small extent, spelling was also influenced by the non-Ottoman Turkic 
language Chagatai (literary eastern Turkic), represented for example by the works 
of the Timurid scholar and poet Ali Shir Nawa’i (15 th century), and literary Azeri 
Turkish also had an impact, represented first and foremost by the poet Fuzuli 
(16 th century). These poets were admired by Ottoman literati and their work was 
even considered to be part of the Ottoman literary canon by some of them. A 
number of odd particularities in the spelling of certain Ottoman Turkish suffixes 
have been attributed to this influence. 

These elements complicated the standardisation of Ottoman orthogra¬ 
phy, but that is not to say that chaos ruled. By approximately 1500, when old 
Ottoman Turkish had developed into early modern (‘middle’) Ottoman Turkish, 
a preferred way of writing the language adhered to by most authors and scribes 
emerged. In the 16 th century, the court in Istanbul (established in 1453) became 
the dominant centre of literary culture (Hazai 2012). Perusing Ottoman manu¬ 
script texts written or copied between, say, the late 15 th and early 20 th centuries 
makes it abundantly clear that, with a few exceptions, the spelling of the Turkish 
lexical elements is quite consistent and variation is only marginal. The excep¬ 
tions here, again, are mostly the few surviving manuscripts produced by less 
well-educated authors/scribes. As Mehmet Yasti has shown, in such literature, 
there could be a wide range of orthographic inconsistencies even within one 
manuscript. In a copy of a work entitled Esraril l-Arifin, a didactic piece on mys¬ 
ticism in simple Turkish by a certain Seyfiillah Kasim el-Bagdadi, preserved in 
a copy made in 1498 and kept in the Siileymaniye Library in Istanbul, the word 
ugmak (‘to fly’) was spelt in four different ways and the word zira (‘because’) 
in as many as five. The letters dm (jr) and gim (jr) were mixed up - one finds 
both gece and geqe (‘night’), for instance. Emphatic and non-emphatic conso¬ 
nants could be alternated: the word su (‘water’) appeared either with a sin (<_>■) 
or a sad ( l)- 3 ), defective beniim (A) is found alongside plene benum (<*_«), etc. 
(Yasti 2010). Perhaps the need for consistent spelling was felt less by the author, 
because he vocalised the text completely and therefore reading problems hardly 
existed. An interesting later example is provided by the Leiden manuscript Or. 
1551, the memoirs of a mercenary soldier composed in 1834, which survives in 
a unique but incomplete copy in a codex format that is different from ‘normal’ 
Ottoman texts (see Fig. 1). There are strong clues in the text indicating that the 
author, who may have been illiterate, dictated it to a scribe. 
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Fig. 1: Title page of the Leiden MS Or. 1551. © Courtesy of the University Library Leiden. 


The spelling of the Turkish words and the Arabic and Persian loanwords is highly 
erratic. Here are some examples (the column on the right gives the modern spel¬ 
ling in Latin script): 


MS Or. 1551 


Normal spelling 

Modern Turkish spelling 


<bk> 

*4h <byk> 

bin 

‘thousand’ 


<tfrwj> 

(Ar.) <tfrj> 

teferruc 

‘inspection’ 


<wyl’yty> 

ij^y j (Ar.+T.) <wl’yty> 

vilayeti 

‘province’ 


<t?lf> 

(Ar.) <t?lyf> 

te’lif 

‘author’ 
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Fig. 2: Opening pages of Usul-i imla. © Courtesy of the University Library Leiden. 


During the period of reform, especially after 1908 when the Young Turks came to 
power, attempts were made to formalise the existing tendency to standardisation 
by publishing spelling manuals. 2 At least one of these was issued by the Ministry 
of Education on behalf of the Grammar and Orthography Committee (in 1917-19). 3 4 
An earlier example, probably the first of its genre and intended for use by profes¬ 
sional clerks, was Mehmed Rasid’s Usul-i imla (‘Principles of Orthography’) from 
1886 (Fig. 2). 4 In the introduction, the author expresses his hope that the book 
will help the reader to avoid using idiosyncratic spellings of Turkish words and 
making spelling mistakes in Arabic and Persian loanwords; these would harm 
their professional reputation and even put them to shame (‘ muharririnin fazl u 
kemaletine elbette nakise tira§ eder... belki hacaletini mucib oluf, pp. 3-4). To 
illustrate a particularly dramatic example of the way in which Arabic loanwords 
used to be misspelt, he presents a list of 48 orthographic variants of the word 
ihzar (‘preparation’) in the Preface, only one of which was correct (p. 7). 


2 See under ‘imla’ and ‘Usul-i imla’ in Ozege 1971-79. 

3 Analysed in Giimus 2008. 

4 The year is mentioned in the Leiden University Library catalogue; Ozege’s bibliography does 
not mention a year of publication, nor is one found in the book itself. The author, as he explains 
in the introduction, was a teacher of calligraphy and orthography at the Naval Academy (Mekteb- 
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3 The orthography of Ottoman Turkish 

So far, we have looked at some of the major tendencies in the development of 
Ottoman Turkish orthography. But how did the Turks adapt the Arabo-Persian alpha¬ 
bet to spell their own language? One problem here was that the Arabic script, like 
the Aramaic-based Uighur script, contains, to put it simply, too many consonants 
and too few vowels to do the job well. Basically (and ignoring historical, regional 
and other nuances for a moment), Ottoman Turkish contains twenty consonants and 
eight vowels, whereas the Arabo-Persian script has 31 consonantal graphemes and 
three vowel graphemes (the latter also functioning as consonants). Aesthetic value 
aside, in the words of Geoffrey Lewis (Lewis 2010,27), ‘there is nothing to be said in 
favour of the Arabo-Persian alphabet as a medium for writing Turkish’. There were 
various ways of dealing with the lack of vowels: (1) not to write any at all, (2) to use 
the three main (long) vowels in Arabic, alif, waw and yd’, to indicate a/e, o/6/u/ii and 
i/ay/ey respectively, or (3) to use superfluous ‘emphatic’ (velar) consonants for indi¬ 
cating unwritten ‘back vowels’ (a, o, u and l) and non-emphatic consonants for indi¬ 
cating ‘front vowels’ (e, 6, ii, i). The matter was complicated further by the letter kaf, 
which could represent g, k, n, v or y (g and n in more plene texts could be specified 
by an extra slanting dash or three dots respectively). As a result, and in conjunc¬ 
tion with the occurrence of a plethora of Arabic and Persian loanwords in most texts, 
many equivocal readings were possible: g j' may be read as Turkish ulu ‘great’, ulu 
(Arabic: ‘possessors’) or olii ‘dead’, evli ‘married’, avlu ‘courtyard’ or avh ‘stocked with 
game’. Jj- can be dol ‘progeny’, dul ‘widowed’ or dilvel (Arabic: ‘states’). JS can be 
gel ‘come’, gill ‘smile’, kel ‘scabby’, kel (Arabic: ‘lassitude’), kill ‘ashes’, killl (Arabic: 
‘all’), gil (Persian: ‘clay’) or gill (Persian: ‘rose’. More reading problems arose because 
scribes and printers were not always careful about word divisions: ^jj could stand 
for bu sene ‘this year’ or Bosna ‘Bosnia’, for instance. 5 Retention of archaic spellings 
or adoption of Chagatai usage caused words like jj' to be read as iyi ‘good’ long after 
ceasing to be pronounced ‘eyii’ (the word evli just mentioned belonged in the same 
category, being spelt <evlw>) and ls jjjS ( kopri , ‘bridge’) was to be read as koprii. 

In order to fix some of the existing drawbacks, from the middle of the 19 th 
century onwards, ideas were launched for modifying the alphabet and/or chang¬ 
ing the orthography of certain words, particularly by writing more plene, that is, 
using more letters per syllable, and more phonetically, e.g. by replacing eyii by iyi 
( l) and kopri by koprii ( jjjj^). 


i Bahriye) in Istanbul and had composed the treatise for his students. He claimed that it was the 
first publication of its kind. 

5 These examples are taken from Lewis 2010, 27-28. 
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Fig. 3: ‘S implified’ spelling initiated by Enver Pasha, early 20 th century. The two illustrated 
words on the top right-hand page are havlu ‘towel’ and hokka ‘inkpot’. (From M. §akir 
Ulkutasir’, Ataturk ve harf devrimi, Ankara 1974). 


In order to overcome the shortcomings of the script, the statesman and intel¬ 
lectual Miinif Pasha suggested in 1862 that diacritics - both existing and newly 
invented ones - should be used to compensate for the deficiencies in existing 
spelling or that the letters should be written separately, which would avoid the 
need for more than one form of a letter. A year later, the Azerbaijani scholar 
Feth-Ali Ahundzade held a public lecture in Istanbul in which he proposed 
to invent new letters to indicate vowels. None of these plans came to much, 
however, although in at least one major publication, §emsuddin Sami’s famous 
encyclopaedia, Ramus, from 1901, three diacritics were used to distinguish 
between o, ii and 6 (normally all indicated by a j). During this late period, the 
notion of writing the letters separately resurfaced. A serious attempt to imple¬ 
ment the idea was made by Enver Pasha, the politician and member of the 
ruling triumvirate, between 1913 and 1917; he especially intended to simplify 
the work of military telegraphists, and various texts produced by the Ministry of 
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War were consequently published in the new script (see Fig. 3). 6 None of these 
steps proved to be adequate or even acceptable, though, and the Arabo-Persian 
script was eventually given up altogether (in 1928). 

As regards the various styles of scripts, in what follows, I will only discuss 
the types most commonly found and not indulge in calligraphic finesses and 
the nuances of sub-types, which are often difficult for the non-specialist to 
distinguish. 7 


4 Ottoman scripts: variation in the outward 
appearance of letters and their use 

Along with the Arabic and Persian languages and literatures, the Ottomans inhe¬ 
rited a manuscript culture from their predecessors, the Turkish principalities 
(emirates) in Anatolia, which in part were inheritors of the Seljuks. As far as I can 
tell, the Ottomans adopted the culture almost wholesale and, apart from making 
modifications to some of the inherited types of script and an additional language, 
Turkish, did not alter it or add to it much. 

The most widely used type was, doubtless, naskhl (or nesih in Turkish) (see 
Fig. 4). This was chosen for works of prose, which also encompassed religious 
and scholarly texts including the Qur’an, and for histories, collections of stories 
and other such accounts. Legend has it that the script was invented by Ibn Muqla 
(d. 940). It was the most readable script of all at the time and was meant to furnish 
easy access to texts. Used in conjunction with diacritical marks, mostly employed 
in religious writings meant for a broader readership, it gave little room for error 
or misinterpretation. Its forms hardly changed over the centuries. It was also pre¬ 
dominantly used in printed books, journals and newspapers from the mid-18 th 
century onwards. 

A larger and more artistic variant of nesih was called thuluth ( sillils ). It was 
sometimes used for book or chapter titles or religious formulae, like the bismillah 
in Ottoman manuscripts, but is rarely encountered in archival documents (where 
nesih, divani and rik'a predominate; see below). The script was mostly used for 
large inscriptions in and on public buildings like mosques and endowment libra¬ 
ries and is often found in calligraphic albums as well. 


6 Ibid., 28-29. 

7 In what follows, I rely mostly on Aktan 1995, 31-72. 
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Fig. 4: Two pages from a multipie-text manuscript with text in ornamental nesih, 17 th century. 
Leiden University Library, MS Or. 12.411. © Courtesy of the University Library Leiden. 


A script of Persian origin - tradition has it that it was introduced into the Ottoman 
Empire by Persian calligraphers during the reign of Mehmed the Conqueror - 
also used what was known as ta'liq (literally meaning ‘suspension’ in Arabic 
and called nasta'liq in Persian) (see Fig. 5). This became widely used in Ottoman 
manuscripts containing collections of poetry and other texts in the genre of 
belles lettres. It was also chosen for documents produced by the religious admin¬ 
istration, such as charters of pious endowments and court registers. In its cal¬ 
ligraphic {cell) variant, it is found on signboards and inscriptions on monuments 
and buildings which were part of pious endowments, such as mosques, religious 
schools {medrese s), libraries and fountains; these texts are often in the artistic 
form of poems. The opposite of calligraphic ta'liq is called sikeste {shikastah in 
Persian, literary meaning ‘broken’ and a ‘sloppy’, irregular version of the script), 
which was used for informal letters and notes by private individuals who did not 
want or need to spend time writing particularly neatly. It was not used much in 
the Ottoman Empire where more casual, idiosyncratic versions of rik’a, to which 
I will return later, were preferred. 
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Fig. 5: Opening page of an Ottoman ‘mirror for princes’ with text in ta'hq, 17 th century. Leiden 
University Library, MS Or. 625. © Courtesy of the University Library Leiden. 


There were two scripts which were almost exclusively used in the Ottoman bureau¬ 
cracy. They shared the feature of irregular ligatures, meaning that in principle 
all the letters could be connected. Firstly, there was siyakat (siyaqah in Arabic), 
a defective script (dots are generally left out) expressing a specialised, encoded 
Arabo-Persian terminology used in the financial administration and inherited 
from the Seljuks (see Fig. 6). 
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Fig. 6: A composite manuscript, 16 th century, with marginal notes, partly in siyakat script. 
Leiden University Library, MS Or. 644, fol. 43b. © Courtesy of the University Library Leiden. 


This was used for financial documents produced by various state institutions, 
including land registers, cadastral entries, parts of deeds of pious endowments 
and military records. To complicate matters (and increasing its illegibility for the 
uninitiated), the script changed greatly over time (which was hardly the case with 
the other Ottoman scripts - why this was so is unknown). The script also used 
special numbers consisting of Arabic letters meant to avoid falsification. Siyakat 
was abandoned during the last quarter of the 19 th century. 
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Secondly, there was divani (‘pertaining to the Council of State’, also known 
as diwani), a calligraphic script used in official documents such as sultans’ edicts 
( fermans ), state letters and ministerial registers. This was invented by the Otto¬ 
mans, although its origin is obscure (according to the polyhistor Mustafa Ali (d. 
1600), it was developed from a ta'/i/c-like Persian script); it first appears in the 
15 th century in official documents and reached its full (classic) development in 
the 18 th century. Like siyakat, the script was rarely printed. This may have had 
something to do with its illegibility for the lay reader. (Occasionally, divani was 
used for other genres as well (see Fig. 7); because the Dutch pastor and Orientalist 
Johannes Heyman (d. 1737) had texts copied by the Dutch consular secretary at 
Izmir, who normally used divani, some manuscripts in his library - now kept by 
the Leiden University Library - contain texts written in divani, a number of which 
are literary pieces.) 

Finally, there was a widely used script called rik’a (or ruq'a, originally an 
Arabic term meaning a piece of paper or a brief message, among other things) (see 
Fig. 8). It was basically a simplified and more easily legible divani which had the 
practical advantage that it could be written quickly because of the large number 
of ligatures it contained. It was an Ottoman invention, appearing for the first time 
in 16 th -century records, and eventually developed into a separate script in the 17 th 
century. Rik c a came to full fruition in the 19 th century and was later adopted by 
Arabs and Persians alike. It was adopted widely in the Ottoman bureaucracy and 
became the predominant script for personal use in the late 18 th century, when it 
also began to be used in manuscripts with texts in various genres. It was some¬ 
times, albeit rarely, printed or used for inscriptions on seal rings or medallions, 
for instance. It is perhaps the only Ottoman script still used in Turkey today - by 
very old people who received their elementary education before 1928, which was 
when the Arabo-Persian scripts were abandoned by the government (suppressed, 
in fact) and the Latin alphabet was made compulsory for public use. 
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Fig. 7: Two pages from a multiple-text manuscript compiled byjohannes Heyman with texts in 
divani script, late 17 th century. Leiden University Library, MS Ac. 87. © Courtesy of the University 
Library Leiden. 


5 Conclusion 

So far, a few remarks have been made on the spelling of Turkish and the most 
common types of script found in Ottoman manuscripts, including archival docu¬ 
ments. Little research has been done on aspects of Ottoman Turkish orthography 
as yet and we are far from having a comprehensive study on its history at our 
disposal. This is no surprise in view of the fact that our knowledge of the his¬ 
torical development of the Ottoman Turkish language is still rudimentary (not 
least, paradoxically, because linguists are mostly dependent on texts in a defec¬ 
tive script produced by and for an urban elite). In this paper, I have largely based 
my findings on my own limited experience as a reader of Ottoman manuscripts 
and printed texts. 
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Fig. 8: A letter from Mehmed Hayriiddin in Stuttgart to Christiaan Snouck Hurgronje in Leiden, 
rik’a script, 1910. Leiden University Library, Or. 8952. © Courtesy of the University Library Leiden. 


As for the central theme of this collective volume, standardisation, I would say 
that a tendency towards standardisation is visible in Ottoman texts written 
between 1300 and 1928, with the exception of texts produced outside the main¬ 
stream genres, but it was never actually formalised. Once the Ottoman Empire 
was securely established around the dynastic court in Istanbul in 1453 and early 
modern Turkish had replaced old Turkish around 1500, the spelling of texts - at 
least those serving the literary elite - began to become relatively stable. As far as I 
am aware, there were no religious or - before 1908 - political incentives, let alone 
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a state policy, that contributed to this development. Manuscript production was 
diffuse, that is, it was not concentrated in one or a few workshops, at least outside 
the palace and the state bureaucracy. As far as we can tell from colophons in 
manuscripts, the copying of texts was a thoroughly decentralised and individual 
activity. Publishing as a commercial activity only came with printing in the 19 th 
century and will inevitably have contributed to standardisation even further. The 
formats used for manuscripts did not influence spelling, it seems. 

For most Ottomans, formats and the aesthetic aspect of the script in its various 
styles seem to have been more important than the way in which words were spelt, 
and they continued to develop the shape of the letters and their ligatures in the 
footsteps of the Seljuks before them, in particular. Calligraphy was considered to 
be one of the highest forms of art, if not an esoteric science, compared to which 
mere orthography was insignificant. 
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Branka Ivusic 

Developing Consistency in the Absence of 
Standards - A Manuscript as a Melting- 
Pot of Languages, Religions and Writing 
Systems 

Abstract: This chapter deals with a late 16 th -century multilingual Ottoman 
manuscript. It is a study of the anonymous scribe’s attempt to write European lan¬ 
guages in Arabic script and it tries to answer questions about possible orthogra¬ 
phic models, the transfer of writing conventions from one language to another or 
from one script to another. The paper also looks at the scribe’s own innovations 
and the consistency of the spelling system employed in this manuscript. 


1 Introduction 

1.1 An ‘Oriental’ manuscript and its ‘European’ texts 

In the collection of Oriental manuscripts preserved in the Austrian National 
Library (Osterreichische Nationalbibliothek, ONB) in Vienna we find a rather 
puzzling Ottoman multiple-text manuscript. A small, but not miniature volume 
of 162 folios 1 , written in Arabic naskhl script and apparently by one hand only, it 
contains more than 300 texts in Ottoman Turkish, Persian and Arabic, but also 
in Croatian (a variety of Central Southern Slavonic that might be termed Croa¬ 
tian), Hungarian, German and Latin. The manuscript is assumed to be the work 
of an anonymous compiler, and, judging from its very regular appearance and the 
remarkably low number of corrections, it is likely to be a fair copy. 

The number of languages found in the manuscript creates a hybrid impres¬ 
sion which is only enhanced when we look at the contents. Most of the texts are 


The present study is part of my PhD research at the Centre for the Study of Manuscript Cultures 
(CSMC) in Hamburg supported by the DFG. 


1 Except for the last ten folios, which were left blank, and folio 107, which has only a catch¬ 
word, all folios are covered by writing on both sides. The average page displays one justified 
block of 15 lines and, if it is a verso-page, a catchword in the left lower margin. 


3 Open Access. © 2019 Branka ivusic, published by De Gruyter. [©ElISHI This work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-007 
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Ottoman Turkish. They range from a language guide to Persian, instructions in 
Islamic faith, astrology and mathematics to love poetry. The few Arabic texts are 
exclusively religious, mostly collections of ahadit while the Persian sample con¬ 
sists only of a few poems along with bits of Persian in the language guide. Such 
texts and this kind of content-specific ‘division of labour’ among the three most 
important languages of the empire are by no means uncommon for an Ottoman 
multiple-text manuscript, a so-called mecmua. The “European” samples on the 
other hand are not only remarkable because they show the use of Arabic script - 
and to some degree Ottoman writing conventions - for languages usually written 
in Latin characters (alongside Cyrillic and Glagolitic, in the case of Croatian) 
but also because of the content of their texts. All four languages appear first in a 
penta-lingual section of the manuscript introduced by a heading and a narrative 
paratext - both in Turkish - stating that what follows are the essentials of Chris¬ 
tian and Jewish beliefs. These essentials turn out to be the Ten Commandments, 
The Lord’s Prayer, and the Apostolic Creed in their Lutheran versions, split into 
smaller parts, which are usually numbered and presented first in Turkish, then 
in Croatian, Hungarian, German, and finally in Latin. The penta-lingual section 
is followed by one without any introductory paratexts comprising songs in all the 
European languages except Latin. With fourteen Christian religious songs, some 
of them composed by Martin Luther and by other prominent German reformers, 
as well as five secular songs, the German sample is by far the largest. Hungarian 
comes second with five Christian religious songs, one love song and one mixed 
text, Ottoman-Turkish with Hungarian insertions, which was also considered a 
love song by most previous researchers but which might be a su/f poem, describ¬ 
ing the longing for knowledge rather than for another person (Sudar 20 05, 75). 
The last text of this section is the only Croatian song in the manuscript. It has also 
traditionally been regarded as a love song - some authors even attributed Petrar- 
cian features to it (Nametak 1981,10) - but in fact it might also be a mystical reli¬ 
gious song. The three texts mentioned last are important for the understanding 
of the whole manuscript because none of them could be identified in any other 
source and they might be works composed by the compiler. The only “European” 
text to be found outside these two sections is a German distich at the very end of 
an anthology of Turkish and Persian distiches. 


1.2 The history of the manuscript 

As mentioned above, the manuscript is among the holdings of the ONB (currently 
under the shelfmark Cod. A.F. 437, Fig. 1). It is not explicitly dated in a colophon 
or chronogram or the like, but folio 2r contains a note in the Latin language and 
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script: ‘Miscellanea varia. Colloquium Persico Turcicum. Professio fidei Turcicae.’ 
[Various miscellanies. Persian-Turkish phrase book. Profession of Turkish faith.], 
a rather understated and incomplete list of contents, but nevertheless useful, as 
it can clearly be attributed to Sebastian Tengnagel (Mittwoch and Mordtmann 
1927, 76) who worked in the Imperial Court Library (later the Austrian National 
Library) from the 1590’s until his death in 1636. Unfortunately, his handwriting 
did not change during this time (Unterkircher 1968, 133) and does not permit 
further dating, therefore the year of his death must serve as a terminus ante quem. 
Later cataloguers claimed that the manuscript once belonged to Tengnagel’s own 
collection which he left to the library upon his death. Tengnagel elaborated two 
catalogues of this collection, one in 1612 (ONB Cod. 9539), the other (ONB Cod. 
12650) at an unknown later date. No manuscript that could be identified with 
the one discussed here is mentioned in the first catalogue, and the second is not 
accessible for reasons of conservation. Thus, at present, it is not possible to deter¬ 
mine whether Tengnagel wrote the note as owner or as custodian. 

Obviously, the librarian, who was very learned and interested in Oriental lan¬ 
guages, was not the first “European” to lay his hands on the manuscript. Someone 
who had no experience with Oriental books made a first foliation, starting at the 
end of the manuscript. The damage to this foliation on nearly every folio shows 
that the manuscript was trimmed, probably at the time it was rebound. 

A terminus post quem is provided by a list of Ottoman Sultans (fols 69v-70r), 
which ends with Murad III, whose reign started in the year 1574 and ended in 
1595. Moreover it was written on watermarked European paper. Curiously, there 
are eleven different watermarks. 2 Two of them are a so-called watermark pair, 
which means that they were produced with two different moulds in the same 
paper mill. All watermarks are located in the fold of the presently rather tightly 
bound codex, hence only a few fragments are visible, so that any identifica¬ 
tion with dated reference examples would be very tentative. Nevertheless, the 
watermark pair appears by shape, size and placement of the chain lines to be 
identical to one used in Vienna in 1587. Despite differences in detail some others 
also show great similarities to watermarks in manuscripts written in Austria 
from 1584 to 1604. 

As with the other conditions of its production, there is no mention in the 
manuscript of its place of origin. The language samples found in the manuscript 


2 Despite this high number of watermarks nothing in the manuscript suggests that it was writ¬ 
ten in more than one production process: Some quires indeed show just one watermark, and 
some watermarks can be found in one quire only, but, on the other hand, there are also quires 
with two watermarks and identical watermarks in different quires all over the manuscript. 
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indicate that it was most likely produced in the Kingdom of Hungary, large parts 
of which were occupied by Ottoman troops by the end of the 16 th century. 

The idea of the manuscript’s having been produced by a European convert 
was proposed very early and is still maintained by some authors (Krstic 2011,88f.; 
Romer 2014). The first cataloguer to describe the manuscript in more than a few 
words, Josef von Hammer, suggested this idea. Moreover he thought the com¬ 
piler was an imprisoned crypto-protestant, who wrote Christian religious texts in 
Arabic characters in order to conceal them from his Muslim guards (Hammer 1812, 
34 No. 281). The compiler’s Hungarian origin was first promoted by Antal Gevay. 
Based on the colophon of the first Hungarian song - a Christian religious captive’s 
lament - he believed he had discovered the compiler’s identity even by name. The 
colophon says: ‘ezer of szaz es hetven / esztendoben <■ csonkatoronnak eros temle- 
ceben / palaszki gyorgy igen bizik istenben <■ bogy nem // hadja sokaig az bilntetes- 
ben’ [In the year one thousand five hundred and seventy, in the strong dungeon 
of Csonka tower, Palaszki Gyorgy indeed trusts in God, so that He doesn’t leave 
him in punishment for long] (fol. 60v, 13 - fol. 61r, 1). Gevay concluded that it was 
Gyorgy Palatics (Gyorgy Palaszki, according to the manuscript) who had compiled 
all the texts and written the manuscript. He even believed he could trace Palat¬ 
ics’ alleged gradual estrangement from his Christian Hungarian origins and native 
tongue in the sequence of texts in the manuscript (Rexa 1901,109). Later, most of 
his assumptions about Gyorgy Palatics turned out to be wrong. The Hungarian 
nobleman was indeed the composer of the song - which can be found in Hungar¬ 
ian hymn books of that time - as well as a captive of the Ottomans for three years, 
but he never converted to Islam, nor did he die in captivity, as Gevay supposed, but 
held important positions thereafter. Although Hungarian researchers dismissed 
Gevay’s idea about Palatics as being the compiler 3 as early as the late 19 th century, 
the manuscript is still referred to as Palatics-kodex in Hungarian publications. 

Later researchers had many intuitively appealing theories about the 
compiler’s origin, religious affiliation and motives, but most of them relied more 
on impressions than on firm evidence. Thus we still know little about the person 
who produced the manuscript. The aim of the present paper is to show how he 
adapted the Arabic script when writing in Croatian, Hungarian, German and 
Latin, and, possibly, to shed some light on his identity; for example, his degree 
of familiarity with writing conventions in other writing systems might tell us 
something about his education. 


3 See Sudar (2006, 9-12) for a more extensive overview of the discussion. 
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2 Arabic script in the European territories of the 
Ottoman Empire 

Islam and Arabic script had been introduced into South Eastern Europe before the 
area was conquered by Ottoman troops and integrated into the Ottoman Empire. 
The process began in the late 14 th century and was at its peak by the middle of the 
16 th century, and only then did Islam and Arabic script have any impact on the 
local population. In the late 16 th century, Ottoman troops held fortresses located 
as far as today’s borders between Hungary, Slovakia and Austria. 


2.1 Writing the languages of the local populations in Arabic 
script 

Although sporadic use of Arabic script is documented for a few languages in the 
region 4 , only Albania and Bosnia, where large parts of the population converted 
to Islam, would develop a real manuscript culture in Arabic script in their respec¬ 
tive local languages. In Bosnia such a manuscript culture can be traced back to 
the 17 th century. First prints in Arabic characters appeared in the 19 th century and 
continued into the 1940s. In the Bosnian research tradition, the phenomenon is 
known as alhamijado knjizevnost ‘aljamiado literature’, a term which probably 
originated in an influential encyclopedia entry (Hadzijahic 1955, 144) where it 
was borrowed from Ibero-Romance studies. Bosnian researchers use the term 
arebica or arabica to refer to the Arabic script when used to write Central South 
Slavonic (CSS) varieties (the term is analogous to cirilica ‘Cyrillic’ and latinica 
‘Latin script’) that were (later) regarded as one language: Serbo-Croatian or Cro- 
ato-Serbian (currently fragmented into the four national standards: Bosnian, 
Croatian, Montenegrin and Serbian). Manuscripts recording such varieties were 
produced in considerably smaller quantities outside of Bosnia and Hercegovina, 
respectively not by Bosnians or Hercegovinians. Perhaps the best known of these 
is one of two manuscripts containing a multilingual phrase book, probably from 
late 15 th century Istanbul, with translations of Arabic sentences into Persian, 
Greek and a Serbian dialect (Aya-Sofya kiitiiphanesi 4750, see Lehfeldt 1989). 


4 For more information on the spread of Arabic script in the Balkans see Zakhos-Papazahariou 
1972 and Hegyi 1979, though e.g. the information they give on the existence of Arabic script ma¬ 
nuscripts in Bulgarian / Macedonian could not be confirmed (Ivusic 2014, 94 n. 25). 
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Unfortunately most of the manuscripts in arebica were - if at all - studied 
only as texts, not as material objects in their own right; reliable editions are scarce. 
The only comparative study of writing conventions based on the manuscripts was 
published in 1969 by the German scholar Werner Lehfeldt (1969), although both 
earlier (Muftic 1964) and more recently (Drkic and Kalajdzija 2010), attempts were 
made to explore their graphematics. 

Despite the fact that some Hungarian converts played important roles in 
Ottoman intellectual life, like the 16 th century interpreter Murad or the 18 th 
century printer ibrahim Miiteferrika, there were never any mass conversions of 
native speakers of Hungarian. Thus Arabic script literacy and proficiency in Hun¬ 
garian has never been a frequent combination. In fact the only known instances 
of Hungarian in Arabic script, except for the manuscript discussed here, are both 
attributed to Murad the interpreter (Murad tercilman), one of which is the only 
other known manuscript containing Latin in Arabic characters; the latter is an 
autograph of Murad’s Islamic religious treatise in Ottoman Turkish and in Latin 
and Hungarian, all three languages being written in Arabic script with an interlin¬ 
ear Latin transcript (MS Marsh 179 in the Bodleian Library). The other consists of 
some marginal notes in the famous Chronicon pictum (Zsinka 1923), a 14 th century 
illuminated chronicle. Pal Acs (2000, 312f.) assumed that these notes were also 
written by Murad’s hand. 

The only other known sample of German in Arabic script consists of a few 
words, idiomatic expressions and parts of a prayer in Evliya Qelebi’s Seyahat- 
name (Kittling 1938; Romer 2009). 


3 Adapting Arabic script for the European samples 
in the manuscript 

Phonologically all four European languages (Croatian, Hungarian, German and 
Latin) differ substantially from Arabic. Whoever recorded them in the manuscript 
used the Persian and Ottoman Turkish varieties of writing, and did not have to 
adapt the Arabic script as used for Arabic. Thus he had additional graphemes at 
his disposal, and was familiar with the writing habits and conventions of Persian 
and Turkish. Nevertheless the writing system was inadequate for representing 
any of these European languages and our scribe was confronted with many prob¬ 
lems especially in vowel transcription. 

Evidently, the samples of the European languages in the manuscript are 
not mere transcriptions of a Latin script, or in the case of the Croatian sample, a 
Glagolitic or Cyrillic model. Each of them shows features absent in the non-Arabic 
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script written sources of that time. One such example is the use of zayn and sin 
for a regular differentiation of voiced [z] and voiceless [s] in German (Ger.) and 
Latin (Lat.), in examples like Ger. o^j' <’w?uns°> [?uns] ‘us’ vs. j>jl <’wun°zar 0 > 
[?unze] ‘our’ (fol. 34r, 6) or Lat. ^^jjj »j <rah zuw?r 2 aq°syit°> [rezu’r:eTsit] ‘he 
arose’ (fol. 37v, 15). This is a sub-phonemic distinction never found in the Latin 
script, in either manuscripts or in prints of the early modern period. Neverthe¬ 
less, the scribe’s attempts at representing spoken language does not mean that 
he based his notation solely on the phonetic principle; some morphological spell¬ 
ings are also found (cf. Section 3.3.2). 


3.1 Consonants 

Most of the consonant phonemes of these European languages are a subset of 
the Arabic phoneme inventory, hence their graphical representation was no chal¬ 
lenge. The same holds for phonemes which are absent from Arabic but present 
in Persian and Turkish, and for which graphemes already existed. Nevertheless, 
each of the European languages has some consonants not found in the three 
Oriental languages. Table 1 shows what solutions the scribe found for their 
graphic representation. All the characters he used when writing consonants in 
the European samples are listed. Since, on the one hand, some of the encoded 
sounds are not phonemes, such as [dj] in German, and on the other hand ^ <k> 
for instance cannot be regarded as a grapheme, not even in Ottoman Turkish, 
if we define grapheme as the smallest distinctive unit of written language, 
the two terms are not used in the table, although most of the character-sound 
correspondences displayed are also grapheme-phoneme correspondences. 
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Table 1: Consonants in the European samples 


Character 

Cro. sound 

Hun. sound 

Ger. sound 

Lat. sound 

m <b> 5 

/b/ 

hi 

hi 

hi 

y cbp 

hi 

hi 

hi 

hi 

cj<t> 

Itl 

hi 

hi 

hi 

e<g> 

■ 

■ 

[d5l 6 

■ 

5<g 1 > 

hi, /ts/ 

hi, /ts/ 

/ts / 

hsu /ts/? 

c <h> 

/X/ 

hi 

hi, /x/=[x]+h] 

hi 

c < b > 

/X/ 

- 

/x/=[x]+h], hi 

hi 

J<d> 

Id/ 

hi 

Id/ 

hi 

J<r> 

hi 

hi 

hi 

hi 

j<z> 

hi 

hi 

hi 

[z] 

^ <Z 2> 

III 

III 

- 

[ 3 ] 

o*<s> 

Isl 

hi 

hi 

hi 

u^<s> 

hi 

hi 

hi 

HI 

D- a <S> 

- 

hi 

- 

- 

-la <t> 

hi. Id/ 

hi 

hi. Id/ 

- 

i<8> 

Isl 

Isl 

Isl 

Isl 

<f> 

hi 

hi 

hi 

hi 

l 3 <q> 

/k/ 

hi 

hi 

hi 

^<k> 

/te/, Ihl, Ipl 

hi, hi, hi 

■ 

[tfeRltl? 

^l<k 3 > 

■ 

hi, hi 

■ 

- 

J<1> 

hi 

hi 

hi 

hi 

f <m> 

/m/ 

hi 

hi 

h/ 

u<n> 

hi 

hi 

hi 

hi 

*<h> 

([h])? 

hi 

hi, hi hi 

- 

if<y> 

I]l, 1+pal) 

Dh [+pal] 

l\l 

Ill 

J<W> 

hi 

hi 

(/v/) 

hi 


Even a cursory glance at the table reveals some important aspects of the scribe’s 
adaption strategy: 1. Although the Arabic script is especially suited for adding 
characters through alternations of the number and position of dots - a method 
frequently employed when adapting it for other languages (Daniels 2014) - the 
scribe did not create any additional characters. 2. Arabic characters encoding 
sounds that are not present in European languages or which are not regularly 
used as a substitute, such as £ <g> for /g/, were not used. The scribe omitted the 


5 The transliteration follows the proposal of Mumin and Versteegh 2014,11-21. 

6 The grapheme jr <g> in the German sample appears in wordforms where the sound [d;] was re¬ 
alised due to regressive assimilation. There is no evidence for [d;] being phonemic in the sample, 
hence it is put in phonetic brackets in the table. 
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graphemes Gj <t>, i <d>, <d>, <d>, £ < c > for /0/, /5/, /dV, /3'7, A/ rather than 

linking them arbitrarily to sounds of the European samples regardless of phonet¬ 
ic proximity. Instead he chose the graphic representation of the Ottoman Turkish 
sound, which he perceived as phonetically closest even if that created new ambi¬ 
guities. A closer look at some cases in which there is no obvious one-to-one char¬ 
acter-sound correspondence will clarify further aspects of the adaption strategy. 


3 . 1.1 £< 1 ^ 

The use of £ <g> for the affricate /ts/ is a good example of how a sound in the 
European languages is represented by the character encoding its closest match 
in Ottoman Turkish. The consonant £ <g > is not an original character of the 
Arabic script, it is a Persian innovation adopted in Ottoman Turkish writing. Both 
Persian and Turkish have a voiceless post-alveolar affricate /tf/, but lack an alve¬ 
olar affricate /ts/. In Central South Slavonic, Hungarian and German both sounds 
are phonemic, while they are absent from Classical Latin. The Latin found in the 
sample contained in the manuscript is of course not Classical Latin, neither is it 
scholarly Neo-Latin, although the latter had some influence on the variety that 
clearly shows features of Medieval Latin. Therefore, it is unlikely that the £ <g> 
in words like G*-jj <tar°g 1 iy’a>, cf. Class. Lat. tertia ‘third’, represents a dental or 
alveolar stop /t/, or a velar stop /k/ in a word like PPp < if: 'n 0 g | yiP i am°>, cf. Class. 
Lat. ancillam ‘maidservant (acc.)’. It is more probable that £ <g> represents /ts/ 
in Gf > <tar°g 1 iy’a>, and /tf/ in in <’ 5 n 0 g 1 yil 2, am°>. When £ <g > is written 
where Class. Lat. ti would appear, there is no reason to assume any other sound 
but /ts/, as the assibilation from /ti/_V to /ts/ was a sound change already doc¬ 
umented in 2 nd or 3 rd century Latin; it was well established in Medieval Latin, 
and was believed to have been classical even by humanists (Stotz 1996, 219). The 
interpretation of those cases where £ <g> encodes a sound corresponding to 
Class. Lat. /k/_/i,e/ is much harder to justify. What can safely be ruled out is that 
£ <gp represents the velar /k/, as all attestations of /k/ followed by a back vowel 
or a consonant are encoded by J <q> (see Section 3.1.2). In Romance languages, 
except Sardinian and Dalmatian, /k/ before front vowels was palatalised to /ts/ 
or /tf/ (a change which often continued to develop), and in Medieval Latin, it was 
also pronounced either as an affricate or as a sibilant (Stotz 1996,183-185; Bonioli 
1962, 73-78). The velar pronunciation was retained in Ireland, and Irish influ¬ 
enced parts of northern England well into the 12 th century. As the character £ <g > 
is used to represent both sounds, /ts/ and /tf/, in the other language samples, and 
both pronunciations existed in Medieval Latin as well, we cannot decide which 
one to assume for the Latin variety. 
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3.1.2 <jj<q>, ^<k> in Croatian, Hungarian, German and Latin 

The employment of the graphemes qaf3 <q> and kaf ^ <k> shows that, occasion¬ 
ally, a shared prominent distinctive feature like palatal articulation was enough 
to establish phonetic proximity for the scribe. Moreover it shows that he aimed 
at a unified transcription system for all four European languages. Like Ottoman 
Turkish, all of them have a voiceless velar stop /k/. In Ottoman Turkish both j 
<q> and ^ <k> are used to write /k/. As with the graphemes for the other pairs of 
emphatic and non-emphatic sounds in the Arabic source orthography, <i <q> and 
ul <k> serve to encode vowels in Ottoman Turkish: j <q> is used in inherited words 
with back vowels as in lcurak ‘dry’ written as j' jA <quraq>, and ^ <k> for those 
with front vowels as in kiirek ‘oar’ written as ul <kurk> (Weil 1917, 24; Bugday 
1999, 13-15). As Ottoman Turkish is a language with vowel harmony based on 
the contrast front/back (front/not-front), such a system of vowel notation is quite 
effective. In the European samples, /k/ is represented by qaf only, both with back 
and front vowels, as can be seen in the examples in Table 2. 


Table 2: <q> in the European samples 



Croatian 

Hungarian 

German 

Latin 

/k/ before back 

A 

I'jLAs) 



vowels 

<qwu> 

<b 1 uwquwl°r a ’a> 

<qwum°> 

<qwun°tr’ a > 


ko ‘who’ 

pokolra ‘to hell’ 

komm ‘come’ 

contra ‘against’ 

/k/ before and 



jibs 


after front vowels 

<wah qah> 

<qiywuwlam°> 

<qyin°dar°> 

<naq°> 


veke ‘age’ 

kivulem ‘besides me’ 

kinder ‘children’ 

nec ‘neither’ 


Kaf on the other hand, is used for different palatal sounds in Croatian and Hun¬ 
garian, and very likely also in Latin. In the Croatian sample, ul <k> represents the 
voiceless palatal affricate /te/ (<c> in modern orthography) as in ^ <wah kah> 
vece ‘bigger’, but also the voiced /ck/ (<d>) as in <sykah> side ‘he descended’ 
and the nasal /ji/ (<nj>) as in <kaguwwuw> njegovo ‘his’. 

In Ottoman Turkish, kaf also had various functions. It was employed for 
the stops /k/ and /g/, for the approximant /iq/, and for the velar nasal /q/. In 
the Western Rumelian dialects (a group of present-day Turkish dialects in the 
Balkans), /k/ and /g/ before front vowels are not only articulated in pre-ve¬ 
lar position, as in Standard Turkish (Stand. Trk.), but have been palatalised to 
palato-velars, palatal affricates or palatal stops (Friedman 2002, 613). We cannot 
say when this process took place, but already in early Turkish loanwords in 
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Serbo-Croatian, or loanwords transmitted via Turkish, /k/ and /g/ before front 
vowels are frequently, although not regularly (Nemeth 1970, 91), substituted by 
/te/ and /d?/ respectively. Famous examples are cuprija ‘bridge’, cf. Stand. Trk. 
koprii (Skaljic 1966,200) or don ‘sole of a shoe’, cf. Stand. Trk. gon ‘worked leather’ 
(Skaljic 1966, 252). In Bosnian alhamijado manuscripts, kaf is regularly used for 
/te/ and frequently for /d?/ as well (Lehfeldt 1968,141-154). When Ottoman clerks 
wrote down Serbo-Croatian names in registers and other administrative docu¬ 
ments, they rendered /te/ by kaf (Alicic 2000, xxiii). In such registers kaf can occa¬ 
sionally represent the palatal nasal /ji/ and even an alveolar nasal /n/. 

In the case of /te/, there might also be another reason for using kaf All writing 
systems employed to write CSS varieties in early modern times had difficulties 
representing this sound. In Latin script manuscripts and prints of the 16 th century, 
various solutions, usually based on Italian orthography, can be observed. Typi¬ 
cally <ch> is used, but <c> and <cch> occur as well, and none of them is satisfy¬ 
ing as they often simultaneously represent the affricates /tf/ and /is/. The two 
Slavonic scripts Glagolitic and Cyrillic were created to put Old Church Slavonic 
(OCS) into writing. Despite being closely related to CSS varieties, OCS has a rather 
different phoneme inventory. Since /te/ is not part of it, a grapheme encoding the 
sound does not exist in either of the Slavonic scripts. In both, the sound /te/ is 
usually represented by a character which was used for the OCS sound sequence 
/Jt/, since OCS /ft/ and CSS /te/ are corresponding reflexes of the same Proto-Sla¬ 
vonic sound. But as Trunte (2012, 186) points out, in Cyrillic manuscripts the 
grapheme k <k> for the velar stop /k/ is frequently used for /te/. Moreover he 
found such a spelling, if only in one instance, in a Glagolitic print, a historiogra¬ 
phy published in 1531 in Rijeka (Ital. Fiume). It cannot be excluded that whoever 
wrote our manuscript, knew about these spellings in the Slavonic scripts. 

Encoding the nasal /ji/ by kaf is almost unparalleled in Arabic script CSS. 
With one exception, and in only one instance, Bosnian alhamijado manuscripts 
do not have this spelling (Lehfeldt 1969,156). In the Serbian sample of the men¬ 
tioned multilingual phrase-book, /ji/ is represented by a Digraph 4 <ny>, some¬ 
times adding hamza above ya (Lehfeldt 1989, 49). The rendering might be moti¬ 
vated by the Ottoman Turkish use of kaf for a nasal, although in Ottoman Turkish 
it is a velar nasal, and not palatal. But as we have seen so far, in the scribe’s 
reanalysis of the character it was disassociated from the feature ‘velar’ and linked 
to the feature ‘palatal’. Initially, this reanalysis was probably triggered by the 
phonetics of the sounds which corresponded to /k/ and /g/ before front vowels 
in Standard Turkish. But then the scribe took a further step: the nasal encoded 
by kaf in the Ottoman Turkish variety which served him as a source language 
for the script might not have been velar anymore but was certainly not palatal 
either, so when the scribe chose to encode /ji/ by kaf he picked the most logical 
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representation in the systematics of the writing system he devised, regardless of 
a phonetic justification, and thereby he emancipated the graphic representation 
from phonology. 


3.1.2.1 Three-dotted kaf& <k 3 > and kaf ^ <k> 

In Ottoman manuscripts the alternative for encoding the velar nasal is the 
three-dotted kaf& <k 3 >. This character is also used in the manuscript for the palatal 
nasal, but only in the Hungarian sample, not in the Croatian one. In Hungarian, 
the three-dotted kaf encodes /ji/ as in d£ji <’ s r°k 3 aq> [‘a:rjie:k] arnyek ‘shadow’, 
but also the voiced palatal stop or affricate 7 /}/ as in ^ <nalak 3 °> [‘nele:j] ne legy 
‘don’t be’. Both Hungarian sounds can be represented by kaf without dots as well, 
as in <kyitwum°> [‘jritom] nyitom ‘I open’ and ^ <n’ak°> [rmf] nagy ‘big’. Kaf 
without dots is also used for the voiceless palatal stop or affricate /c/ in tel <‘ a k’ a > 
[‘dcd] atya ‘father’. But the spelling of Hungarian palatals is even more variable, 
because all of them can be represented by digraphs with ya. Table 3 summarises 
the various graphic representations of Hungarian palatals, listing them in order 
of frequency. 


Table 3: Graphic representations of Hungarian palatals 


/c/ ^<ty>, ^<k> 

Itl ^ <k>, <k 3 >, <ky>, <k 3 y>, <dy> 

/ji/ ttl<k>, ^ <k 3 >, yH <ny> 


There are two possible explanations for these spelling variations and, perhaps, 
both have played a part. Firstly, they might have been motivated by phonetic fea¬ 
tures of the Hungarian sounds. Unlike the Hungarian /c/ and /)/, Central South 
Slavonic /te/ and /ck/ are always realised as affricates, not as palatal stops. Palatal 
stops cannot be distinguished from palatalised velars regarding their distinctive 
features (Hall 1997, 70-76). If Hungarian /c/ and /}/ are identical to /k 1 / and /g 1 /, it 
is not surprising that the scribe used ya ’ as a marker of palatalisation to represent 


7 There is an ongoing debate in Hungarian phonological theory as to whether /}/ and /c/ should 
be classified as stops or affricates at the phonological level (Szende 1992,119). Although it is not 
disputed that they are realised as affricates in certain contexts - and as stops in others - they 
pattern with stops in two important respects: First they can be realised by their unreleased allo- 
phones before stops, and second when they appear on both sides of a word boundary, they are 
merged into a geminate (Siptar and Torkenczy 2000, 83). 
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them. Central South Slavonic (CSS) /te/ and /d?/, on the other hand, differ from / 
kV and /g'/ as they are [+strident]. The sounds /te/, /dj/, /c/ and /}/ are palatal but 
only the latter two show realisations which are not distinguishable from palatal¬ 
ised sounds. The presence of the digraph spellings with ya for /c/ and /}/, and 
the absence of that spelling for /te/ and /d?/ might reflect the phonetic distance 
between these two sets of sounds. Secondly, the writing conventions of the Hun¬ 
garian Latin script might have been a model for the digraph writings. As can be 
seen in the Hungarian examples quoted, digraphs are used in modern Hungarian 
orthography to represent each of the three sounds: <ty> encodes /c/, <gy> /}/ and 
<ny> /ji/; these conventions were already well established in the late 16 th century 
(Kniezsa 1959, 17f.). What remains unclear is why the three-dotted kaf does not 
appear in Central South Slavonic (CSS) at least as an allograph for /ji/. Today /ji/ 
is basically the same sound in all known varieties of Hungarian and Central South 
Slavonic but this may not have been the case when the manuscript was written. 

The Latin sample shows only one instance of kaf, in ‘GS/yj <viyr°kinah> 
virgine ‘virgin (abl.)’. This is also the only attestation of what would be /g/ fol¬ 
lowed by a front vowel in Classical Latin. So instead of assuming that kaf rep¬ 
resents a velar /g/, rendered by g ayn in all other attestations of the sound in the 
European samples, it seems much more likely that kaf represents a palatal, close 
to 1)1 or /d?/. That the sound in question might have been /dj/, the pronunciation 
of Latin /g/ followed by i or e in Italy (Bonioli 1962, 81f.), also seems unlikely. In 
that case the scribe would have used £ <g>, as he did when rendering the sound 
in Arabic, Turkish, Persian and German, where it is not phonemic but a product 
of assimilation. 

German lacks this kind of palatal phoneme. Palato-velars, palatal affricates, 
stops and nasals, the domains of kaf in the other European samples, are all 
absent from the German phoneme inventory, consequently kaf does not occur in 
the German sample at all. We may assume that the scribe did not want to spoil the 
disambiguation he had devised for the other languages. 


3.1.3 C <h>, t <b> and 8 <h> 

One case seems to contradict the idea that the scribe omitted Arabic graphemes 
representing sounds that are not part of the phoneme inventory of the European 
languages or their closest match: this is the use of Arabic characters for velar, 
pharyngeal and glottal fricatives. Although none of the European languages has a 
pharyngeal fricative /h/, all of the European language samples show the use of £ 
<h>, the character which represents the sound in Arabic orthography. The expla¬ 
nation of this seeming exception lies in the use of such characters in Ottoman 
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Turkish, which served as a transmitter for the Arabic writing system. In Ottoman 
Turkish only the glottal fricative /h/ was phonemic, and yet the Arabic frica¬ 
tive graphemes which encode the velar and pharyngeal fricatives were used in 
Ottoman writing conventions. Their use was almost exclusively limited to words 
of Arabic origin but they are occasionally found in Persian and Turkish words. In 
most instances, however, the pronunciation was simply [h]. 


3.1.3.1 Croatian 

In the Croatian sample £ <h> is used alongside £ <h> to represent the velar fricative 
/x/, but there seems to be a positional distribution: £ <h> represents the sound in 
syllable onset only, as in <hyiz 2 uw?> hizu ‘house (acc.)’, £ <h> on the other 
hand is used in syllable coda (with one exception), as in jyjjj <z 2 yiw°yih > zivih 
‘the living (gen.pl.)’. The only occurrence of • <h> as a consonant spelling in the 
Croatian sample is in the interjection <’ 5 h°> ah ‘oh’, and here it is impossible 
to tell whether the encoded sound was velar or glottal. Curiously, in Bosnian 
aljamiado manuscripts, £ <h>, the original Arabic grapheme encoding /x/, is 
exceedingly rare and does not appear in the observed distribution (Lehfeldt 1969, 
166-172). In the Serbian parts of the phrase book,» <h> is written for /x/ in all 
positions, £ <h> and £ <h> appear only in one instance each, both in Oriental 
names: <huwarazmis°qy> (Aya-Sofya kiitiiphanesi 4750, fol. llr) ‘Chor- 

asmian’ and <hasan°> (Aya-Sofya kiitiiphanesi 4750, fol. 14v) ‘Hasan’. These 
examples of CSS in Arabic script seem to show that the transmission of the script 
via Turkish, as suggested above, obscured the function of £ <h> in Arabic, so 
that the character could not simply be adopted for writing CSS /x/. Apparently 
the character ^ <h> became the default spelling for post-palatal fricatives in CSS 
written in Arabic script. 


3.1.3.2 Hungarian and Latin 

Hungarian has only one post-palatal fricative, traditionally assumed to be /h/, 
but in syllable final position the realisation is either [x] or zero. 8 However, in syl¬ 
lable initial position, the Hungarian sample shows 132 instances in which £ <h> is 
used, as in cj! jU. <ha’za’t> hazat ‘his house (acc.)’, and there are four attestations 


8 Siptar (1994, 213 and 265-268) still assumes /h/ to be the basic underlying phoneme of which 
the other realisations are derived. Siptar and Torkenczy (2000, 274-277) later stated that, at least 
in today’s pronunciation of educated non-language-conservative speakers from the Budapest 
area (Educated Colloquial Hungarian), /x/ is the basic phoneme while [h] and zero have to be 
regarded as derived allophones. 
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of o <h> (cf. Section 3.3.1). In the Latin sample the glottal fricative is also encoded 
by £ <h>; the only use of £ <h> appears in dw <mah’abaryis°> mechaberis ‘you 

will commit adultery’ and it reflects a velar fricative in the Greek loanword. 


3.1.3.3 German 

In German the situation is more complicated. The consonant phonemes of German 
include a glottal fricative and a velar fricative with two complementary variants: 
velar fricative [x] after back vowels and palatal fricative [cl after front vowels and 
consonants. As can be seen in Table 1, £ <h> is used for all three sounds; nev¬ 
ertheless, there is a definite preference. Of the 283 attestations of £ <h> in the 
German sample only four represent the palatal fricative [g], 22 represent the velar 
fricative [x] but 19 of the latter are instances of the same lexeme. Seven are special 
cases, because they represent various spelling pronunciations (cf. Section 3.3.3). 
The remaining 250 instances of £ <h> indicate the glottal fricative. But the same 
sound is frequently encoded by» <h>. However, the use of» <h> seems to depend 
on position: 81 of 89 attestations of » <h> (when used as a consonant character) 
in the German sample are syllable initial. Moreover, 53 of these spellings occur in 
words with the sound sequence her, most of them being forms or derivations of 
German Herr ‘lord’. This spelling might have been influenced by the spelling of 
the frequent Ottoman Turkish > her ‘every’. The other instances of the character 
»<h> twice represent a syllable final fricative in the interjections oh and ah, which 
might be glottal as well as velar; in a further two examples,»<h> reflects spelling 
pronunciations (cf. Section 3.3.3) and there are four instances of the palatal frica¬ 
tive, but the latter is always found in derivations of machtig ‘mighty’. 

The default choice for representing both the velar and the palatal fricative 
was £ <h>. Only eight out of 377 attestations render the glottal fricative and all 
eight are either instances of the word £ <huwh°> hoch ‘high, up’ or compounds 
with hoch, where the representation of the first fricative was probably influenced 
by the last. 


3.2 Vowels 

All European languages in the manuscript have more vowel phonemes than 
Arabic. The German and Hungarian vocalic systems are especially rich and, like 
Ottoman Turkish, they include rounded front vowels. Vowel quantity is phonemic 
in all of these European languages, and accent is also phonemic in both Central 
South Slavonic (pitch accent) and in German (stress accent), but neither quantity 
nor accent are encoded in the European samples. In each of them, the graphemes 
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representing long vowels in Arabic, ’alif ' <’>, ya ls <y>, and wdw j <w> as well 
as the diacritics representing short vowels , fatha <a>, kasra <i>, damma <u>, 
are used regardless of quantity or of any feature other than vowel quality; this 
mirrors Ottoman Turkish writing habits. The same holds for the different repre¬ 
sentations of a vowel sound depending on its position in the word. It is impossi¬ 
ble to display all the manifold vowel representations in all the languages (some of 
the polygraphic combinations occur in one instance only), but Table 4 (cf. Appen¬ 
dix II) shows the most frequent representations for each of them. 

Some of the assumed vowel phonemes in Table 4 are questionable. For 
instance the diphthong /ie/ is the most plausible interpretation of the graphic 
sequence cs.<iya> in Croatian jig <tiyalwu> tijelo ‘body’, but this would be the only 
attestation in which Proto Slavonic *e (the sound known as jat’) is reflected as /m/. 


3.2.1 Implementing a marker of distinction 

There is no simple one-to-one correspondence between graphemes and vowel 
phonemes. Nevertheless /i/ (/i:/, /y/) can be distinguished from any other vowel. 
In most cases it is also possible to distinguish between renderings of /a/ (/a:/, 
/d/) and those of /e/ (/e:/, /e/, /e:/, /a/). However, the graphic representation of 
rounded vowels is ambiguous in all the European languages. In the first com¬ 
mandment, which is also the first instance of the European languages in the 
manuscript, the Croatian, German and Hungarian texts contain three different 
rounded vowels, as shown in (1). 


(1) Cro. y’a s’am 0 

twuy 0 

buwg 0 

d’atyi 


Ja sam 

tvoj 

bog 

da=ti 


I am 

your 

god 

comp=2.sg.dat 

nabwudah 


wah kah 

buwz 2 yi 

ne=bud-e 


vec-e 

1 

boz-i 

NEG=be.PRS.PFV-3SG 

greater-ACC.PL 

god-? 

wyisah 


’wud° 

manah 


vise (vis-e) 


od 

mene 


more (greater-ACC.PL) 

from 

me 



‘I am thy God, do not have greater and more/higher gods than me’. 
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’an 0 1 w’dcwuq 0 

>5 Z ° 

tah 

’wur’ad 0 

’yis°tanad° 

En 1 vagy-ok 

az 

te 

ur-ad 

isten-ed 

I be.PRS -Isg 

DET 

you 

lord-POSS.SG.-2SGgod-POSS.SG.2SG 

nalakanaq 0 1 

’an 0 

qyiwuwlam 0 

’iydah gan° ’yis°tanyid° 

ne=legyenek / 

en 

kiviil- 

em 

idegen isten-id 


NEG=be.iMP.3PL I except- poss.sg.1sg foreign god- poss.pl.2sg 

‘I am thy Lord, thy God, thou shalt not have foreign gods except me’. 

Ger. dwu suwl°t nyit 0 ’ 5 n°dah rah gat 2 ar° naban 0 | my^r 
Du sol-t nit ander-e gett-er neben \ mir 

you shall-2SG.PRSNEG other-ACC.PL god/PL-ACC.PL beside me 
h’aban 
haben 
have. inf 

‘Thou shalt not have other gods beside me’. 

All rounded vowels are represented by the combination of waw j <w> and damma ' 
<u> (in word initial position preceded by ’a/if' <’>). But already in the second com¬ 
mandment, the first attempts at disambiguation can be observed: a new combi¬ 
nation involving hamza <?> is introduced. The distinctive function of hamza is 
established most consistently in Latin: all instances of back mid /o/ and /o:/ show 
waw+damma j <wu>, while waw+hamza+damma j <w?u> renders back high 
(close) /u/ and /u:/ only. In the second commandment in Croatian j <w?u> occurs 
once representing a word final /u/ while the /u/ in mid-position in the same word 
is again written without hamza. In the Hungarian and German texts of the second 
commandment j <wu> remains the graphic representation of both back vowels, 
but hamza is used twice in the German text, representing /y:/ and /y/. The fourth 
commandment shows the first use of hamza in Hungarian; as in German it was, 
originally, not used to differentiate the back vowels but to mark the rounded front 
vowels /yl and /y:/ (or /p:/) 9 . Since the German sample is mainly Central Bavar¬ 
ian (Bichlmeier and Ivusic 2013), and because one feature of Central Bavarian is 
the delabialisation of Middle High German rounded front vowels, these vowels are 
present in a few instances only and hamza develops the same function of marking 
back high (close) vowels in German as in Latin and Croatian. The further the scribe 


9 The spelling djjLjji <’w?ud°nab 1 0 rw?ul 0 > can be interpreted as udneprol or as udneprill ‘of 
the holiday’. The allomorph of the Modern Standard Hungarian delative ending, -rol is -rill in 
various dialects; in late 16 th century written Hungarian, both variants were used (Papp 1961, 44f. 
and 168). 
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proceeded in writing the manuscript, the more regular the use of hamza became. It 
spread to words which were written only with j in earlier attestations in the man¬ 
uscript, and then to all positions in the word. In one of the songs following the 
penta-lingual section (fol. 42v, 10, Fig. 1) hamza has been added to represent the 
/u:/ in the German word jy <nw?un°> nun ‘now’. 10 This gradual increase of con¬ 
sistency in the use of hamza can be observed in the Hungarian sample as well, but 
here it never becomes associated with one vowel only, rather it is used for both /u/ 
and /u:/ and for the rounded front vowels /p/, /p:/, /y/ and /y:/ as well as for /o/ 
and /o:/. Interestingly, no development in disambiguating back mid and back high 
vowels takes place in the Latin sample. Thus we find two distinct spellings in the 
first instances of the respective phonemes. As already mentioned, the manuscript is 
assumed to be a fair copy. It is possible that the first version of the Latin sample was 
written somewhat later than the samples of the other European languages, when 
the hamza spelling for u was already fixed. 


3.3 Peculiarities of the graphic representation 

3.3.1 Use of characters for Arabic emphatic consonants 

As mentioned, no systematic use of the characters for Arabic emphatic conso¬ 
nants in vowel notation, as it existed in Ottoman Turkish, can be observed in 
any of the European samples. Nevertheless, a few traces of that use are present 
in each of the samples except for Latin. Croatian, Hungarian and German show 
some (Cro. 4, Hun. 3, Ger. 8) instances of the character -L <t> representing /t/ and 
/d/ (only in Croatian and Hungarian) in words with back vowels. If the words are 
not hapax legomena, as three of the Croatian word forms are, they are attested 
with <t> and <d> spellings as well. In the case of Hungarian, adopting the Ottoman 
writing habit would have been possible without much effort since Hungarian has 
a type of vowel harmony close to Ottoman Turkish (front/back), and indeed the 
Hungarian sample shows more instances of the emphatic consonants compared 
to the other European samples. Thus, (a) it is the only European sample using o- 3 


10 This spelling can be identified as a correction because the hamza is added in red ink. Red ink 
is used in the manuscript for certain elements as verse separators, ciphers (if part of an enumera¬ 
tion) and headings. In very rare cases diacritic vowels, dots and other script elements are written 
in red. Sometimes this was simply done for ornamental reasons, in which case, all diacritics in 
a verse would be red; more often, one of them was simply forgotten and added later, when the 
scribe had already switched to red ink, as is the case in the example cited. 
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Fig. 1: Cod. A.F. 437, fol 42v. © Osterreichische Nationalbibliothek Wien. 



166 


Branka Ivusic 


<s> in three attestations, all in words with back vowels; (b) there are four word- 
forms which deviate from the usual encoding of fricative /h/ with £ <h>, using» 
<h> instead. All four words contain front vowels, three are instances of the palatal 
allomorph of the marker of situational possibility -het (denoting a meaning close 
to ‘can’), which also appears with the usual £ <h> spelling, but these three attes¬ 
tations with o <h> are part of an enumeration, in which they stand in contrast with 
word-forms containing the velar allomorph of the marker -hat written with £ <h>, 
as illustrated below (inverted commas are verse separators). 

(2) sah ’aPhatam 0 sah h’al 0 ha’tuwm°‘ sah’uw?P|hatam° sah 
se el-het-emse hal-hat-om« seiil|-het-em se 
neither live-can-lSG neither die-can-lSG neither sit-can-lSG neither 
y’ar°h > atwum 0 ‘ sah ’ahatam 0 sah ’yih’atuwm 0 ‘ | 

jar-hat-om <■ se ehet-em se ihat-om 11 ‘ 

walk-can-lSG neither eat.can-lSG neither drink.can.-lSG 

‘Neither can I live nor can I die, neither can I sit nor can I walk, neither 
can I eat nor can I drink’. 

And finally, there is one exception to the general rule that the velar /k/ is repre¬ 
sented by l 3 <q> in the European samples (cf. Section 3.1.2): In the mixed Hungar¬ 
ian/Turkish text, the final consonant of the Hungarian lexeme lelek ‘soul’ is ren¬ 
dered by ^ <k> in all nine attestations. In other texts of the Hungarian sample, the 
same word is written with the usual <i <q>. It might be that in this mostly Turkish 
text Turkish writing habits spread to the Hungarian parts as well. However, it is 
not clear why only instances of one lexeme are affected, since the three other 
words in the text which have /k/ in the front vowel environment (kegyelmed ‘Your 
Grace’, kegyelmes ‘gracious’ and szeretomnek ‘to my beloved’) show the j <q> 
representation. 


3.3.2 Morphological spellings 

As has been argued so far, the notation of the European samples in the manuscript 
aims at representing spoken language and is not a transcript of a model in another 


11 The initial /i/ of the verb inni ‘to drink’ is palatal in Modern Hungarian but it developed 
from a velar vowel /ui/ which merged with palatal /i/ in the first centuries of the Old Hungarian 
period, 896-1526, (Abaffy 2003, 320f.). Reflexes of the original velar quality can still be seen in 
Modern Hungarian, as the verb demands the velar allomorphs of endings and suffixes. 
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writing system. Nevertheless, the phonetic principle was not the only one emplo¬ 
yed in the writing of the European languages. 

In Hungarian, the palatal stops or affricates /}/ and /c/ (already discussed) 
are often created in a morphophonemic process when roots and stems ending in 
/d/ and /t/ merge with suffixes and endings with initial /j/. In the manuscript the 
resulting word-forms are usually written in a way that represents their morpho¬ 
logical structure at the expense of their phonetics, as can be seen in examples like 
3" <wyg’dyw?nq> [‘vigD/afunk] vigadjunk ‘we shall celebrate’ ( vigad + l.pl. 
prs.imp.indf.). 

In the German sample, final devoicing, a very prominent phonological 
process in most dialects since the Middle High German period and in Modern 
Standard German, is not represented graphically. The devoicing affects stops and 
fricatives in syllable final position, thereby obscuring paradigmatic relations, as 
in Tag [ta:k] ‘day’ vs. Tage [‘ta:ga] ‘days’. As in Modern German orthography, the 
graphic representation in the manuscript does not reflect devoicing. In the case 
of the manuscript, one might rather speak of hardening, since the consonants in 
question do not differ in voice but stand in a fortis-lenis opposition in Bavarian, 
but the same character is employed for both fortis and lenis as in ^ <t’ag°> Tag 
and in <t’agah> Tage. 

Both instances of morphological spellings were writing habits which, in the 
late 16 th century, were developing into conventions in the Latin script orthogra¬ 
phies of the respective languages; but since both reflect the analysis of rather 
transparent morphological processes, there is no need to assume an influence of 
Latin script writing. 


3.3.3 Latin script convention or spelling pronunciation? 

There is one further phenomenon in the German sample that, at first sight, is 
an adoption of a Latin script spelling convention. Several attestations in the 
German sample contain something that is termed stummes h ‘silent h’. The term 
refers to the character <h> when it is not pronounced as a fricative. In some words 
the spelling reflects a historical fricative pronunciation but in most cases silent 
h is a purely orthographical element, marking either vowel length or the sylla¬ 
ble boundary. However, the presence of such spellings in word-forms like 
<gahat> gehet ‘goes’ in the manuscript is not necessarily an imitation of Latin 
script writing conventions. The realisation of the silent h as a fricative is a typical 
spelling pronunciation in German and as such often encountered in the speech 
of religious services and in many church songs. The distribution of the silent h 
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spellings in the manuscript suggests that, in fact, they might well be renderings 
of the spelling pronunciation as they occur in religious texts only. 


3.3.4 Gemination and double characters 

As in Arabic, but not in Ottoman Turkish, gemination is phonemic in Hungarian 
and Latin. In both languages it is indicated by doubling the consonant graph¬ 
eme in Latin script writing. In Hungarian complex consonant graphemes, such 
as <sz> /s/ or <cs> /tf/, only the first component is written twice, hence /s:/ is 
represented by <ssz>. In Latin-script German, the doubling of consonant graph¬ 
emes also exists, but, in contrast to Latin and Hungarian, it does not encode the 
quantity of consonants, which is not phonemic in German, but indicates that the 
vowel preceding the doubled consonant grapheme is short. In all three language 
samples, the Arabic diacritic sadda' < 2 > indicating gemination in Arabic is used, 
but all of them also show doubling of the consonant grapheme, sometimes even 
a combination of both is found, i.e., superscription of one of the two identical 
consonant graphemes with sadda. 

Unsurprisingly, the last mentioned spelling is most frequently encountered 
with /l/ and /l:/, as for these consonants a prominent model, *dl' allah, existed. 
Writing the consonant grapheme twice occurs in Latin with /r:/, in Hungarian 
with /b:/, /l:/, /t:/ and /z:/, and in German, the following characters are doubled: 
11 <H>, li <tt> and - <ss>. At morpheme boundaries, where the final consonant 
of the preceding morpheme and the initial consonant of the following mor¬ 
pheme are the same, the doubling of consonant graphemes seems natural and 
can be encountered in Ottoman Turkish as well. One might argue that the double 
spellings in Hungarian and Latin are due to a misconception of the morpheme 
boundaries by the scribe (although nothing in the Hungarian and Latin samples 
supports such an assumption), but in the German sample, where neither sadda 
nor the double graphemes can be phonetically motivated, the influence of Latin 
script writing habits seems to be the only explanation. 


4 Conclusion 

The inclusion of texts in four languages, not usually written in Arabic charac¬ 
ters, into an Ottoman multiple-text manuscript, was an ambitious endeavour. 
The scribe had to adapt a single writing system for languages which differ sub¬ 
stantially from each other and from those whose Arabic script writing habits 
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and conventions he knew. As shown, his adaption remained rather conserva¬ 
tive, he did not devise any new characters nor did he use those which were part 
of the Ottoman Turkish inventory independently of their use in that language. 
Nevertheless, he developed original solutions for the graphic representation of 
phonemes which are not part of the phoneme inventories of Ottoman Turkish, 
Persian or Arabic; these solutions can be seen in his use of kaf ^ <k> and hamza 
c <?>. His system was able to represent most of the consonant phonemes of 
the European languages, but the representation of vowels, especially rounded 
vowels, was very ambiguous and can be interpreted only by readers well famil¬ 
iar with the respective languages. The vowel phoneme inventory of Ottoman 
Turkish is also underspecified by vowel graphemes of Arabic script, but some 
of the shortcomings could be resolved by the Ottoman system of indicating the 
backness of the vowel by the graphemes for the uvular or pharingealised conso¬ 
nants in the environment of the vowel. In the European samples there is no sys¬ 
tematic use of the Ottoman Turkish writing conventions, although some traces 
can be found. 

This paper claims that the graphic representation of the European samples 
renders spoken rather than written language; nevertheless, these representa¬ 
tions are not based only on phonetics. A few instances of morphological spell¬ 
ings which were also features of the respective Latin script orthographies can 
be observed. Some peculiarities of spelling could be influenced by writing con¬ 
ventions in other writing systems, but the only phenomenon where Latin script 
orthography is the most probable explanation is the use of sadda '< 2 > and double 
graphemes in Hungarian, Latin and German. The fact that the scribe must have 
been familiar with German Latin script writing, at least to some extent, can be 
clearly seen in the German sample where sadda '< 2 > and double graphemes are 
not used to represent gemination. 
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Appendices 

Appendix I: List of abbreviations 


Languages 


Class. Lat. 

Classical Latin 

Cro. 

Croatian 

CSS 

Central South Slavonic 

Ger. 

German 

Hun. 

Hungarian 

Lat. 

Latin 

OCS 

Oid Church Slavonic 

Stand. Trk. 

Standard Turkish 

Other abbreviations 

1 . 

first person 

2. 

second person 

3. 

third person 

abl. 

ablative 

acc. 

accusative 

comp. 

complementizer 

dat. 

dative 

det. 

determiner 

gen. 

genitive 

imp. 

imperative 

indf. 

indefinite 

inf. 

infinitive 

neg. 

negation 

ONB 

Osterreichische Nationalbibliothek 

pfv. 

perfective 

pi. 

plural 

poss. 

possessive 

prs. 

present 

r 

recto 

sg. 

singular 

V 

verso 
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Appendix II: 

Table 4: Most frequent graphic representations of vowel phonemes 


Vowel 

Position 

Graphic representation 

Language 


Initial 

i <’a> 

Cro., Flun., Ger. 


1 <’a> 

Hun.? 12 

a 

Mid 

1 <’a> 

Cro., Hun.?, Ger., Lat. 



1 <’a> 

Cro., Ger., Lat. 


Final 

1 <’a> 

Cro., Hun.?, Ger., Lat. 



1 <’a> 

Cro., Hun.?, Lat. 

a: 

Initial 

i < a > 

Hun., Lat. 



i <’a> 

Lat. 


Mid 

1 <’a> 

Cro., Hun., Ger., Lat. 



i<’ a > 

Cro., Hun., Ger., Lat. 


Final 

1 <’a> 

Hun., Ger. 



1 <’ a > 

Hun., Ger. 

D 

Initial 

i <’ a > 

Hun.? 



1 <’a> 

Hun.? 


Mid 

1 <’a> 

Hun.? 



j <wu> 

Ger. 


Final 

1 <’a> 

Hun.? 



i <’ a > 

Hun.? 

D: 

Initial 

- 



Mid 

j <wu> 

Ger. 


Final 

- 


e, £, a 

Initial 

1 <’a> 

Hun., Ger., Lat. 


Mid 

<a> 

Cro., Hun., Ger., Lat. 



»<ah> 

Cro., Hun., Ger., Lat. 


Final 

»<ah> 

Cro., Hun., Ger., Lat. 



A ' 

Q). 

V 

Ger. 



<a> 

Ger. 


12 In Standard Hungarian and most Hungarian dialects, a short phoneme /d/ contrasts with 
long /a:/. In the German sample, which is mainly Central Bavarian, words containing the vowels 
/d/ and /d:/ in Central Bavarian but /a/ and /a:/ in Standard German and other dialects show j 
<wu> spellings, suggesting that the sounds in question are /d/ and /d:/, not /a/ and /a:/. In the 
Hungarian sample, however, words with an assumed /o/ are not written in a way that suggests 
any other sound but /a/. A short /a/ phoneme is known in northern Hungarian dialects (Paloc 
dialects) but there the correspondent of /a:/ in Standard Hungarian and the other dialects is /d:/ 
(Kalman 1966, 40), of which no traces can be found in the Hungarian sample. 
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e:, £: 

Initial 

' <a> 

Flun., Ger., Lat. 


Mid 

<a> 

Cro., Flun., Ger., Lat. 



f <ah> 

Cro., Flun., Ger., Lat. 


Final 

6<ah> 

Hun., Lat. 

i, i 

Initial 

<_$' <’yi> 

Cro., Hun., Ger., Lat. 



Ls' <’y?i> 

Ger. 


Mid 

ls <yi> 

Cro., Hun., Ger., Lat. 



,<i> 

Cro., Hun., Ger., Lat. 


Final 

ls <yi> 

Cro., Hun., Lat. 



,<i> 

Cro. 

i: 

Initial 

ls ' <’yi> 

Hun., Ger., Lat. 



Ls' <’y?i> 

Ger. 


Mid 

ls <yi> 

Cro., Hun., Ger. 



<i> 

Cro., Ger. 


Final 

ls< y\> 

Ger. 

0 , D 

Initial 

j' <’wu> 

Cro., Hun., Ger., Lat. 


Mid 

J<WU> 

Cro., Hun., Ger., Lat. 



<u> 

Cro. 


Final 

J<WU> 

Cro. 

o: 

Initial 

j' <’wu> 

Hun. 


Mid 

J<WU> 

Cro., Hun., Ger., Lat. 



<u> 

Cro. 


Final 

j<WU> 

Hun., Ger., Lat. 

0 

Initial 

j' <’w?u> 

Hun. 



j' <’wu> 

Hun. 


Mid 

J <w?u> 

Hun. 



J<WU> 

Hun. 


Final 

- 


0: 

Initial 

j' <’w?u> 

Hun. 


Mid 

J <w?u> 

Hun. 



j<WU> 

Hun., Ger. 



<u> 

Ger. 


Final 

J <w?u> 

Hun. 

U, u 

Initial 

j' <’w?u> 

Cro., Hun., Ger., Lat. 



j' <’wu> 

Cro., Hun., Ger. 


Mid 

J <w?u> 

Cro., Hun., Ger., Lat. 



J<WU> 

Cro., Hun. 


Final 

J <w?u> 

Cro., Lat. 



J<WU> 

Cro. 

u: 

Initial 

j' <’w?u> 

Hun. 


Mid 

J <w?u> 

Cro., Hun., Ger. 



J<WU> 

Cro., Hun., Ger. 


Final 

J <w?u> 

Hun., Ger. 



J<WU> 

Ger. 
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y.Y 

Initial 

jl <’w?u> 

Hun. 



jl <’wu> 

Hun. 


Mid 

j <w?u> 

Hun., Ger. 



j <wu> 

Hun. 


Final 

- 


y: 

Initial 

- 



Mid 

j <w?u> 

Hun., Ger. 


Final 

j <w?u> 

Hun. 

r 

Initial 

- 



Mid 

jj<yir> 

Cro. 



j<ar> 

Cro. 


Final 

- 


ay 

Initial 

<’ a y> 

Ger. 



lsI <’ay> 

Ger. 


Mid 

<’ay> 

Ger. 



lS <ay> 

Ger. 


Final 

ls' <’ay> 

Ger. 

au 

Initial 

ji <’ s w> 

Ger. 



jl <’aw> 

Ger. 


Mid 

Co t 
A 

s' 

V 

Ger. 



jl <’aw> 

Ger. 


Final 

jl <’aw> 

Ger. 

ve 

Initial 

- 



Mid 

LS<iya > 

Ger. 


Final 

■G.<iyah> 

Ger. 



<r.<iya't> 

Ger. 

J£ 

■ 




Mid. 

cs<:iya> 

Cro. 

DY 

Initial 

lS ji <’uwy> 

Ger. 


Mid 

csj <wuy> 

Ger. 


Final 

lSj <wuy> 

Ger. 

YB 

Initial 

- 



Mid 

j<wa> 

Ger. 


Final 

■ 






Florian Sobieroj 

Standardisation in Manuscripts written in 
Sino-Arabic Scripts and xiaojing 

Abstract: Standardisation processes concerning orthography, handwriting and 
page layout can be observed in manuscripts written in Sino-Arabic scripts that 
may or may not include transliterated Chinese-language texts (xiaojing). 1 Besides 
identifying some of these processes, it is the objective of the present paper to 
explore the xiaojing phenomenon with regard to name and script, earliest evi¬ 
dence as well as its function as a system of writing Chinese. The material used 
for this investigation are trilingual manuscripts written in Arabic, Persian and 
Chinese mostly produced in Northwest China in the context of Naqshbandiyya- 
based Sufism and higher education at the madrasas. Accordingly, the texts inscri¬ 
bed in the manuscripts relate mainly to Islamic mysticism and dogma, to prayer 
and philology. In the presentation of this material, different page-layout formats 
and configurations of languages will be looked at and the conventions that have 
been followed in writing xiaojing will also be taken into consideration. 


1 Introduction 

In this paper, an attempt has been made to identify some standardisation pro¬ 
cesses concerning orthography, handwriting and page layout in manuscripts 
written in Sino-Arabic scripts ( khatt-i sini), which include xiaojing, i.e. Chinese 
texts transliterated in the Arabic script. An effort has been made to grasp the xiao¬ 
jing phenomenon by looking at its name, the script used for it, the earliest evi¬ 
dence of its use and by studying its function as a system of transcribing Chinese 2 
while also pointing out its inherent deficiencies. The multilingual manuscripts in 
Arabic, Persian and Chinese used as material for this investigation were mostly 
produced in Northwest China in the cultural environments of the Naqshbandi 
Sufi and madrasa education. The material includes some of the most important 
Islamic texts which continue to be studied by Chinese Muslims, such as the mys- 


1 Sino-Arabic scripts in this paper are defined as styles of Arabic writing which show a Chinese 
influence, irrespective of the language, be it Arabic, Persian or Chinese, while xiaojing signifies 
(dialect forms of) the Chinese language written in the Arabic script. 

2 It will be shown that the Arabic writing system allows to record the spoken Chinese language 
in a way that the Chinese characters cannot. 

3 Open Access. © 2019 Florian Sobieroj, published by De Gruyter. [/“hyMsHBI This work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-008 
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tical Kitab al-Jahri, Manaqib and Ashi“at-i Lama’at, and additionally some gram¬ 
mars, catechisms, glossaries and prayer books. In this presentation of textual 
material, different page-layout formats and configurations of languages will be 
identified. 

Besides covering these points, the xiaojing conventions generally observed 
by scribes will be outlined and exemplified. These include the following, among 
others: the choice of a small, unpretentious script (mostly); full (or nearly full) 
vocalisation; graph features serving as adaptations to the Chinese sound system, 
such as doubling of vowel signs within the rasm (consonantal skeleton) of a 
word; the letters /cd/and sad with three dots above them; fatha added ‘superflu¬ 
ously’ to a consonant followed by alif; hybrid forms including Chinese characters 
(occasionally); jim written in the initial form even when it occurs in isolation; and 
the lack of distinction made between the consonantal sounds x- (as in Chinese 
xi) and s- (as in si), although the sounds q- (as in quart) and j- (as in jiang) are 
distinguished. The xiaojing writing system also accurately represents final nasal 
consonants in accordance with dialectal differences, such as -n/-q variation. In 
Mandarin variety spoken in Linxia (Lanyin ^f[l Mandarin, the dialect retaining 
some linguistic traits of Middle Chinese (MC) and which features prominently in 
the manuscripts discussed in this chapter) the final nasal is realised as [-n] and 
this sound is written as <n> in Arabic script. This differs from the xiaojing used for 
writing the dialects exhibiting the final velar nasal [-q], which is represented by 
the letter /cd/with three dots above. 3 

The Latin-script transliterations of the xiaojing transcriptions given in this 
chapter follow the pinyin system used in the People’s Republic of China to trans¬ 
literate Chinese, despite the fact that pinyin is a rather artificial system and far 
from ideal for recording the variety of the spoken languages. As the xiaojing texts 
are transliterations of Chinese dialects rather than of Mandarin (Putonghua), the 
pinyin transliterations can only be considered as approximations to the actual 
pronunciation. 


3 The authors of the Chinese Wikipedia article ‘/J'JLff’ (xiaoer jing) (Wikipedia ‘/WUS’ xiao- 
er jing 2019) specify that xiaojing not only transliterates the language spoken in the Northwest 
(Lanyin) but also that of the Central Plain and of the Northeast. According to the charts in the 
same Wikipedia article, velarised /r)/ is represented by final kaf& written with three dots above 
and by initial consonant with sukun above it + vowel, e.g. *31) (= yang ‘sheep’), ^ 1 j (= wang 
7s, ‘to forget’), “ j) (= yong ffl, ‘use’); however, these combinations of letters have not been seen 
in the manuscripts studied, nor are they included in the Xining xiaojing syllabary (see below). It 
will be demonstrated (e.g. section 4.4.1.), that even the structure of the Chinese language tran¬ 
scribed in the manuscripts shows dialectal influence. 
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1.1 The name ‘xiaojing’ 

The name of the Arabic transliterations of Chinese language is used in a small 
number of variations. Xiaojing /hH (lit. smaller canonical writings) is generally 
understood to refer to children because it was used in teaching young children 
Islamic texts in Chinese before they studied the Arabic language. 4 5 Corresponding 
to this understanding, the designation xiaoer jing /hJEM, meaning ‘children’s 
canonical writings’ has also been used, and the name xiaojing may therefore also 
be considered an abbreviation of this designation. This didactic use of xiaojing 
is apparent in a small number of Chinese catechisms written in the Arabic script 
said to have been composed specifically for Muslim children, for instance those 
authored by Ma Tianmin in the middle of the 20 th century, as discussed in 

section 4.3. However, Arabic transliterations of Chinese were not only employed 
in primary schools ( xiaoxue /Mjfc), which were financed by the Muslim commu¬ 
nities (jiaofang jkViT and therefore enjoyed independence from the state, but 
they were also used widely in jingtang jiaoyu M'vlSlhF, or madrasa education, in 
Northwest China. This institution emerged in the middle of the 16 th c. in Shaanxi 
province, from where it moved to southern Ningxia. 6 


4 Cf. Bakhtiyar (1994, vol. 4, 71). 

5 Cf. Stocker-Parnian 2002,154. 

6 More specifically, to the Tongxin MIT' area of Ningxia; cf. Zhou (2008, 47-48), who mentions 

the name of Hu Dengzhou (d. in 1597) from Shaanxi province, the founder of jingtang 

jiaoyu (also Yang 1996, 79). 
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Fig. 1: The provinces Qinghai, Gansu, Ningxia, and Shaanxi. 


Feng Zenglie (2007, 618) has proposed an alternative explanation of the 

name xiaojing /hi!, however. He explains the designation in relation to the com¬ 
plementary term dajing XM: the ‘larger jing’ are the Islamic scriptures in the Arabic 
and Persian languages, whereas the ‘smaller jing’ (xiao jing) are those written in 
Chinese language, but in Arabic script. Besides this, Feng mentions the variant 
designation xiaojing (with nearly identical pronunciation [different tone of the 
first word], but written in different characters), which he explains as denoting the 
‘digesting of [Islamic] canonical literature’ (xiaohua jingwen iOftMTQ. 7 


7 The authors of the article ‘/p jLH’ ( xiaoerjing ) (Wikipedia vh)LI£’ xiaoer jing 2019) claim, 
while referring to an internet publication as their source (footnote 2: @1^0^inlK jftXXHuizu de 
yuyan he wenzi , published 2005 on the site “tM Ningxia liiyou wang, a tourism page), 

that in the Northwest the name variant xiaoer jin /]' is generally used, abbreviated as xiaojin 
/hHH, while in the Central plain and the Northeast the preferred form is xiaoerjing dxKIS. 
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1.2 The alphabet and scripts 

The xiaojing writing system is based on the 28 letters of the Arabic alphabet, 
to which xiaojing adds some letter forms derived from Persian, such as pa, cim 
and zay, and a few relatively unfamiliar graphs such as sad with three dots 
above it (as used for the initial c-, as in can (Mandarin cang) M, ‘storage’)- 8 
The three dots above the Arabic letter leaf [to represent the phoneme j-, as in jiu 
M, ‘wine’ in the standard language [not identical with the one transliterated 
in the xiaojing texts]) 9 are an adaptation to the Chinese sound system, whilst 
the doublings of damma (nunation), fatha and kasra, which were added to the 
consonantal letters in the middle or final position, is a characteristic feature 
of xiaojing. 10 These orthographic features, with variations, are standard usage 
throughout the xiaojing manuscripts consulted. Although they are not written 
in one particular calligraphic style, the letters are mostly written in a very small 
script. Arabic texts such as the Qur’an, Takhmis al-Burda (Mukhammas ) and 
Mada’ih, which are often accompanied by xiaojing glosses in the manuscripts, 
are written in the bold ‘hieratical’ 11 khatt-i sini style in many cases, which is 
understood to show the influence of Chinese calligraphy. The presence of a 
sacred text seems to function as a trigger, fostering a standard in handwriting 
which may or may not include xiaojing. 


8 The letter sad with three dots is also used in the writings of the Tatars of Lithuania and Poland 
(as pointed out by Alessandro Gori); for this letter in Belorussian, Lithuanian and Polish Tatar 
manuscripts see Miskinene 2015, 66); a relevant image can be found in Sobieroj 2010, plate 3 
(with three dots beneath [!] sad). 

9 Kaf with three dots above was used for /r \/ in Ottoman Turkish, for /g/ in North African Arabic 
and Berber, for /g, t], ng/ in Wolof, and in other sub-Saharan writing traditions as well (as 
indicated by the editors of this volume). 

10 A useful set of charts of xiaojing letters and compounds accompanied by their Chinese 

character homologues in Hanzi ts ^ >=>• j* j ‘f j?) is included in the Xining 

edition of ‘Aqidat al-islam / Xinyang wenda (no date), pp. 77-79. A list of the phonetic values of 
36 consonants in the initial position and 73 syllables ending with a vowel can be found in the 
Wikipedia article 7JQLI5’ (Wikipedia 7JQLH’ xiaoerjing 2019). 

11 I have designated it this way because the classical SInl style was mainly employed to 
represent writings considered sacred by Northwest Chinese Muslims. 
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1.3 The significance of xiaojing as a system of transliteration 

In his volume dedicated to the subject of Islamic education among the Hui, desi¬ 
gnated as a ‘national minority’ in China, 12 Zhou Chuanbin JitHlf® highlights 
the unique role played by xiaojing as a system of phonetic transliteration of the 
Chinese language in use in the pre-modern era and as a direct (chronological) pre¬ 
cursor of the Latin script. 13 However, as Zhou points out (2008, 60-61), outside 
China, in the former Soviet Union, the Cyrillic script has been used to transli¬ 
terate Chinese in a version developed on the basis of xiaojing. 111 The Chinese in 
question are the so-called Donggan, Muslims from the provinces of Shaanxi and 
Gansu who were deported beyond the borders of the Qing Empire to Central Asia 
- Yang (1996, 71-72) adds that the destination was Kyrgyzstan, Uzbekistan and 
Kazakhstan. The deportees, referred to in the Russian Czarist empire as ‘Dungan’ 
but belonging to the same stock as the Hui in China, initially used xiaojing to 
transcribe their native language, but in the first half of the 20 th century they were 
made to adopt the Cyrillic alphabet after having gone through a phase of using 
the Latin script. 15 


1.4 The earliest evidence of xiaojing 

Zhou (2008, 57) mentions a stone monument erected in Xixiang W# Mosque at 
Xi’an University as the earliest evidence of the usage of xiaojing. This monument 
bears an Arabic text on the construction of the building, including Chinese per- 


12 Zhou (2008,59) includes some pictures illustrating the usage of xiaojing , including a modern 
Arabic transcription of Zhonghua renmin gonghe guo, ‘The People’s Republic of China’; Zhou 
(2008, 79) offers a picture of xiaojing marginalia in a manuscript of the Arabic grammar Zaowu 
misubaha )lE3i7|tj(f BBn [ Daw’ al-misbah], 

13 Already in the late 16 th century the fesuits had developed a system of transliteration that has 
left traces in modern pinyin (for the wider context see e.g. Mungello 1989). An in-depth study 
of xiaojing transliterations carried out with the help of linguists while especially taking into 
account the unusual graph features may help to reconstruct the Chinese language spoken at 
the time under scrutiny. A comparative study of the xiaojing writing system and the Dunggan 
material from the former Soviet Union is also a desideratum. 

14 It should be added that the Dongxiang ethnic group, who live in the Linxia region of Gansu 

province, also used a variant of xiaojing (cf. Yibulaxin J) hfijjft 2007, 138) called Dongxiang 
wen ‘writing of the Dongxiang’ or Huihui wen PUIfilTt, see (Wikipedia VjUlil’ xiaoer 

jing 2019). 

15 For the context see e.g. Dyer 1967. 
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sonal names and biographical notes written in the Arabic script in 1339-40. 16 
However, xiaojing was only used more widely after the emergence of the madrasa 
system at the end of the Ming era (1368-1644). Feng (2007, 618b) mentions two 
manuscripts produced after the rise of the madrasas as the oldest evidence of the 
usage of xiaojing in Islamic writings which are designated as jingji MUf) by Feng; 
one of these is an 18 ,h -century copy of the Persian Sufi manual Mirsad al-‘ibad 
written by the Kubrawl mystic Najm al-Din RazI (d. 654/1256; Okuyan 1988ff., vol. 
32,496-497); this includes Chinese glosses written in Arabic. 17 The manuscript was 
taken to France in 1909 and given to the Bibliotheque nationale in Paris as part of 
the exploits of the famous expedition to Gansu led by Henri d’Ollone (d. 1945). It 
was then described (albeit rather briefly) by Emile Blochet (1909,290 [no. 8]). 18 


1.5 The shortcomings of xiaojing 

One major shortcoming of this transliteration system was pointed out by Zhou 
(2008, 60), who says that a unified system of writing xiaojing has never existed; 
instead of adopting one standard, every Chinese dialect written in the Arabic 
script has been transcribed according to its own system, which means that certain 
conventions existed at the dialect level. 19 However, generally speaking, a lack of 


16 A dating most likely referring to the manuscript rather than to the text. Zhou also fails to 
indicate whether or not the transcription of these names shows any characteristics specific to 
xiaojing. Bakhtiyar (1994, 4, 77; with an illustration) mentions the earliest evidence of xiaojing 
which he saw in a manuscript, namely a copy of Rashid al-Din Fadl Allah’s (d. 1318) work on 
medicine, Tibb-i ahl-i Khita, apparently dated 1313 CE, i.e. during the author’s lifetime (!). 

17 In view of its importance as a foundational text for the Kubrawiyya Sufi order, the Mirsad was 

translated into Chinese: ‘Wu Zunqi 'tEliSg began translating it in 1672 and completed it six years 
later. He entitled it l§ jfclDllflti Guizhen yaodao shiyi ‘Explanation of the Main Path of Returning 
to God’, or Guizhen yaodao ‘The Main Path of Returning to God’, etc.’ (Ma 1999, 34). 

18 Feng (2007, 618b) confuses ‘F. (sic!) Blochet’ with Henri d’Ollone by making an explorer out 
of the cataloguer while also failing to mention d’Ollone’s name. The xiaojing marginalia in this 
manuscript are also mentioned in Bausani (1968, 875). 

19 Xiaojing transcriptions seem to have mainly been used in manuscript production in the 

Northwest (but see fn. 3). This impression is based on the perusal of some Arabic manuscripts 
that originated in the southerly province of Yunnan in which there are no traces of xiaojing. One 
example is a facsimile manuscript of the Muttasiq al-nahw, which is an introduction to Arabic 
grammar written by Ma Fuchu US® (Na 2007, 330-331), i.e. Ma Dexin SffISf Yusuf (d. 1874), 
printed in 1375/1955-56 and sold as a reprint from 1405 [= 1984-5] in Kunming (Chinese title on 
the front cover: Jianming ayuxue The 102-page Arabic manuscript with a 12-page 

glossary is accompanied by extensive glosses mainly written in the margins, but not a single 
one is in xiaojing. The first page of the manuscript includes biographical notes on the author in 
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standardisation in its orthography is apparent. Feng (2007, 618b) and Yang (1996, 
82-84) both highlight a lack of standardisation as well, which they trace back 
to the influence of dialects and the tendency of the authors of xiaojing (para-) 
texts to act independently of one another. Another reason is the different sound 
systems of Chinese and Arabic: the intonation and tone patterns of Chinese, i.e. 
the multiple tones (in the modern standard language four, in certain dialects 
more, in others less), are not represented graphically in the Arabic transcriptions, 
which often leads to confusion and mistakes. Besides that, no distinction was 
made in transcribing words consisting of one or two syllables respectively, such 
as jiu ‘wine’ and ji you ‘chicken oil’ (which probably sounded more or less the 
same; the examples are taken from Yang 1996, 84). 


2 Xiaojing in Kitab al-Jahri 

Ma Xuezhi Muhammad Mansur, one of the authors of the sacred biography 

of the Naqshbandi (Jahri) reformer of Guanchuan l®jl| in Gansu, Ma Mingxin J§ 
n/l'G Wiqayatullah (d. 1781; Ma 2007, 345; Forbes 1960-2004, 5, 851a), says in the 
preface of his Arabic-language Kitab al-Jahri (Mansur 1933,15) 20 that he is going to 
use xiaojing at certain points in the text, for which he gives the following reason: 
‘Most of the words of our shaykhs are in the Chinese language. Expressing them 
in Arabic is difficult, so some sayings are in Chinese, as it has been feared that the 
[meaning of the] original saying may be altered by the Arabic expression’. 21 

The xiaojing transcriptions scattered in the facsimile Arabic manuscript of 
the Kitab al-Jahri (429 pages of 15 lines written by at least two hands in khatt-i 
sini scripts) may be divided into personal names, toponyms, poetical verses, dia¬ 
logues, proverbs, glosses and compounds. 


Arabic, while the glossary has translations of well over a thousand words ( mufradat ) occurring 
in the text, which are written in Chinese characters. 

20 The K. al-Jahri on the history of eight generations of masters of the Jahriyya branch of the 
Naqshbandiyya order was authored by Ma Xuezhi S’PIl' Mansur (d. 1923) and was published in 
1933 as a facsimile manuscript written in a khatt-i sini script by Ibrahim, Mansur’s servant (a partial 
Chinese translation was published in 1997). The text starts with the vita of the ‘Pathfounder’, Ma 
Mingxin, and ends with that of Ma Zhenwu SMS;, who died in 1960 (cf. Sobieroj 2016a, 140). 

21 The passage is translated in the Daotong shizhuan (Mansur 1997, Introduction, 10): ‘[...] in 
order to maintain the original meaning of the Mawla’s [i.e. Mingxin’s] words (baochi yuanhua), 
I have used a xiaoerjin /J\!n!JS (!) transcription [in] many places and hope that the reader will 
understand [...]’. 
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2.1 Personal names 

Laojun which literally means ‘old man’, designates the deified Laozi. It occurs 

in a question uttered reproachfully by Ma Mingxin (‘Why, then, are there people [in 
my ritual circle] who still believe that the “Old Man” is God’ [man Vtaqada al-Lao- 
jun ilahan ]) and was directed at an erstwhile Taoist monk who had converted to 
the Islam of the Jahriyya. The name laojun written with the Arabic definite article 
has three dots above the letter kaf and nunation above the final ya’\ jU'. 

The letter kaf representing ‘Jie’ in the name of the famous 18 th -century mystic 
Liu Jielian MfYM 22 (Mansur 1933, 40) is likewise written with three dots above 
the horizontal line. 

The names of the (eras of the) Qing emperors Qianlong ^|S§ and Kangxi 0 
J?S, which are mentioned in Kitab al-Jahn (Mansur 1933, 69) to specify the year of 
the ‘martyrdom’ ( shahadatuhu ), i.e. death, and birth of Mingxin respectively, are 
also transcribed in the Arabic script, viz. as juSSll jimi a nd Only 

the first name, al-Sultan al-Kianlun (= al-Qianlong), is provided with vowel signs; 
the second one was left void of any vocalisation. The lack of vocalisation in the 
writing of al-Kansi is indicative of the absence of standardisation - unless it has 
been omitted because the matres lectionis make the rasm unambiguous. 23 To sum 
up, then, there seems to have been a tendency to standardise the transcription of 
names by prefixing the Arabic article al- to Chinese names. 


2.2 Toponyms 

In accordance with xiaojing conventions, topographical names are usually provi¬ 
ded with full vocalisation in the Arabic text (e.g. in Mansur 1933,32, al-Lianhuachen 
Ot' chen is used instead of cheng ‘town’ for standard /rj/). However, 
in the second occurrence of the place name four lines below that, the vocalisa¬ 
tion is dispensed with, perhaps because of having been considered redundant. On 
p. 32, the names of the town of Fuqiang if ft and the mountain called Liujiapo 
HWM. are also stated in the Arabic script, the latter name with full vocalisation. 

In the toponym Qin’an xian written as (Mansur 1933, 37), the 

non-phonemic initial consonant sound of an 'if is irregularly rendered by ghayn 


22 I.e. the mystical philosopher Liu Zhi Sim? of Nanjing (d. 1745; Luo 2007, 321-322), the ‘Ibn 
‘Arab! of China’. 

23 In contrast, the name of the chieftain of the village of Didianzi, Ma Laoye is only 

provided in the Arabic translation: al-Amir al-Farasi (Mansur 1933, 76, penultimate line). 
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as opposed to ‘ ayn . (This is the so called ‘empty initial’, or ‘zero onset’, which 
have a range of variation such as [?], [y]. Ip] and [h], Duanmu 2007). 

The name of the river called Miaoer he HfJEM is transcribed with 

full vocalisation, including the word he for ‘river’, with damma above the letter 
kha‘ and fatha above the letter waw. The name of the Gaoshan ifijj 111 or ‘High Hill’ 
mountain mentioned three lines above, in contrast, is given in the Arabic transla¬ 
tion, viz. al-Jabal al-‘ali (Mansur 1933, 45). 

The toponym Guanmenkou ISP 1 ! P (Mansur 1933, 46), to which the definite 
article is added, has been transcribed with vocalisation. The vocalisation of men 
F! ‘gate’ with damma and kou P ‘mouth’/‘entrance’ with kasra beneath the 
letter kaf _£ is also noteworthy. 

In the place name Puer zhuang WW3E (Mansur 1933, 46), consistently, the 
final [-resound of zhuang has not been reproduced in the Arabic script either. 
Conspicuously, the syllable er has merely been transcribed by the letter ‘ayn with 
the fatha vowel sign. 

Only rarely has a place name been left without any vocalisation at all. In one 
such instance, the unvocalised xiaojing is explained in an Arabic translation: on 
p. 76, the location called Didianzi which was populated by people who 

honoured Mingxin, but hated his successor, the Shaykh of Pingliang, is tran¬ 
scribed as 344-, but the name is preceded by an Arabic translation and note, viz. 
rib at al-qa‘r ; wa-huwa bi-lisan al-sin Didianzi ‘hospice of the depression, which is 
called D. in Chinese’. It appears that toponyms are regularly provided with vocal¬ 
isations in the manuscript, while less often a place name is given in an Arabic 
translation. 


2.3 Chinese poetry 

The lines of a duilian Tj'# or ‘antithetical couplet of parallel sentences’ 24 which 
Ma Mingxin recited extemporaneously in reply to a request by some dignitaries 
at the governor’s court in Xi’an, are transcribed in the Arabic script. As for the 
features of the script, the khatt-i sini used for transcribing the duilian has been 


24 The couplet quoted by Mingxin is designated as poetry both in Kitab al-Jahri and the Chinese 
translation: The daozu jfiii, ‘Pathfounder’, was asked to compose poetry (inshad al-shVr, zuo 
shou shi (TTt iFf) by his detractors and he accepted the challenge: ‘You want me to compose vers¬ 
es, while you cannot compose the first line (shang lian J;#). Well, in that case I will compose the 
first and you do the second line’ (dui xia lian ; Mansur 1997, 14-15; Mansur 1933, 28). The 

duilian includes rhyme words, viz. ‘Shaanxi’ Rl5 and ‘ma ti’ Mi© (‘horses’ hoofs’). 
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provided with full vowel signs, and in accordance with the traditional techniques 
of writing poetry in Arabic and Persian manuscripts, the endings of the verses 
have been marked by clusters of dots. The script of the xiaojing does not differ 
from that of its textual environment apart from its vocalisation. 

As an inclusion of the material in the manuscript of Zhanye’s WM Manaqib 
(e.g. p. 21) shows (for the text see below [p. 191]), it has been standard procedure 
to transcribe the poetical verses composed and recited by the masters of the Jahri- 
yya in xiaojing rather than translating them into Arabic. 25 The verses are made 
to stand out by using full vocalisation, and verse markers have been inserted 
between the hemistichs and at the end of the lines as well. 


2.4 Dialogues 

A second example of a xiaojing passage integrated within the main text of the 
Kitab al-Jahri can be found in the second chapter, which is dedicated to the life 
of Wiqayatullah’s successor, the ‘Pingliang taiye’ ( Hadrat Maulana 

al-A‘zam al-Pinlianfuwi; i.e. shaykh of Pingliang prefecture), also known as 
Mu Xianzhang WMM (cf. Ma 1999 [1985], 93). A somewhat obscure dialogue 
conducted between the Sufi Shaykh and the Taoist abbot ( sayyid al-ruhban ‘lord 
of the monks’) of the nearby monastery on Mount Kongtong 26 is given in the ori¬ 
ginal Chinese wording without a translation. 27 The dialogue included in the nar¬ 
rative seems to constitute an exercise in mufakhara, in which two disputants try 
to outwit one another. The four (unrhymed) lines of xiaojing stand out against 
their textual environment because of the vowel signs and the circles added as 


25 The recitations - again following the conventions of Arabic and Persian manuscript culture 
- are regularly introduced by the formula fa-anshada [shVran bi-lughat al-Sin], ‘he recited (a 
poem in Chinese)’ (e.g. Manaqib, p. 71; also pp. 211, 212, 216, 226 and 264). 

26 The greatest scenic attraction of the Pingliang area is the Taoist monastery on the summit of 

Kongtongshan (Mount Kongtong) which is also known as fiu gong ba tai shier yuan A,® 

A n'+^ISc, ‘Nine palaces, eight terraces, twelve courtyards’. 

27 ‘One day [...], the Pingliang Taiye went to the famous Mt Kongtong west of the city to gather 

some medicinal herbs. Once he had climbed up to the summit of the mountain, he encountered 
the head of the monks there, who was called Chen Banxian the “half-immortal Chen” [a 

bad pun or mockery, FS]. When the latter saw the Pingliang Taiye, he said: “You are wearing a 
dress of coarse material (« -5 y tubu yi ATfiTk) [...]”. The Taiye said: “I can move the pillar which 
borders on the sky ([...] 14 U j2; -£ wo shou ban tian bian zhu Aifi.fi) with my hand while 

my foot is on the ground” 0 A- 2 t yijiao tan dao di —PS5!liS). The monk could not think of 
anything to respond to that. The Taiye picked the herbs [he wanted], went back down the moun¬ 
tain and returned’ (Mansur 1997, 79; Mansur 1933, 91). 
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‘verse markers’. The traditional standards of Arabic poetry quotation have been 
followed by using these devices, although the four lines in themselves do not 
constitute poetry (Fig. 2). 
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Fig. 2: Kitab al-Jahri (a dialogue between the Taoist prior and the Shaykh). 


2.5 Proverbs 

Proverbs, too, have been left devoid of any Arabic translation in Kitab al-Jahri; 
they are provided in xiaojing, obviously resulting from the author’s apprehension 
that the meanings of the idiomatic expressions would not be adequately con¬ 
veyed by a translation. 

The two parts of the proverb, constituting a chengyu (fiScim) of four words, 28 in 
Mansur 1933,113 (Mansur 1997,101), which is written in Arabic script with vocali- 


28 The idiomatic phrase is uttered, in the third chapter on the life of the master Chuanchang 
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sation in the manuscript, are separated by a verse marker in the form of a circle. 
There is an overline at the beginning, the middle and the end of the proverb. The 
placing of three dots above the letter o- 3 in the word cang fS (the last word of the 
phrase) is a regular feature. However, cang is written with the homophone cha¬ 
racter ?fr rather than # in the Daotong shizhuan. The consonant x- in xiu flf ‘to 
build’ is represented by sin as in other cases too. 

The description above serves to show that proverbs and words exchanged in 
dialogues have been treated in the same way as poetry in terms of their graphic 
representation. 


2.6 Glosses (interlinear and marginal) 

Besides the xiaojing included in the main textual body of the Kitah al-Jahri, 
numerous glosses mostly written in Chinese characters have been added to the 
manuscript. These were written down around 2008 by a user who at that time 
was a student and a novice of Honglefu Convent and he owned the man¬ 

uscript. There are also a few cases of Chinese words in xiaojing transliteration 
between the lines and occasionally in the margins as well. 

An example of an interlinear gloss can be found in Mansur (1933,37,1.4 from 
below) where the phrase mata min dhalik majnunan, ‘he died as a result of this, as 
a madman’, has been explained by the expression tiba ifcW. ‘promoted by rank’, 29 
written underneath the line; the letter ta’ has been provided with kasra, and the 
letters ba’-alif have been written with fat ha. 

On p. 29, the uncommon word qaswara apparently denoting a lion is explained 
beneath the line by the Chinese term shizi ff T written in Arabic script: - pi. 

A lexical gloss has been added in the outer margin on p. 78: ^ A This entry 
transcribes Chinese guafu Jkftj as the Arabic word armala, i.e. ‘widow’. It seems 
that the scribe initially wrote cjla and only added the letter waw between qaf and 


Taiye (Mansur 1939,105-133), by the ‘Alim (saghlr) sultanl, Tittle kingly scholar’, called 

Xiaoshan Wangye (/hlL|7E:ft) in the Chinese translation (Mansur 1997, 101). The ‘Alim sultanl 
criticised the Sufi master whom he charged of insincerity, by uttering a proverb which is replete 
with historical connotations (qala bi-lisan al-sin, ‘he said in the Chinese language’; Mansur 1933, 
113): Mingxiuzhandao/anduchencang pU.ji-A * jili [building 

a covered way along a precipice openly - secretly taking the old path to Chencang] (Mansur 
1933,113; Mansur 1997, 101). Peiqi Yan of Munich University (LMU) has kindly guided me to the 
literary source of this proverb (cf. Cai and Sun IS 2008, 545). 

29 In Sufism, the majnun, turned mad through passionate love of God, is considered seized by 
divine attraction ( jadhba ), hence ‘promoted’. 
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a/i/aftcrwards or else he superscribed the letter waw so it would be interpreted as 
the vowel sign damma. 

The expressions ibtila’an wa-'khtibaran ‘trials and tests’ encountered on 
p. 66,1.6 are accompanied by two words in xiaojing transcription written beneath 
the base line, the first of which is illegible. The second word transcribed with full 
vocalisation is clear enough, however, and can be deciphered as VJ* or kaoyan % 
St ‘test, testing’. 

As we have seen, a few marginal and interlinear lexical glosses in xiaojing 
transcription have been added to the manuscript of Kitab al-Jahri and they reflect 
practices of annotation typical in the Arabic manuscript culture in their layout 
and functionality. 


2.7 Hybrid compounds 

There are also some examples of a mixture of the two, Chinese characters and 
Chinese in Arabic transcription. An example of this phenomenon may be found 
in Kitab al-Jahri, p. 54, line 5 from below, where the Arabic word for ‘nose’, khay- 
shumi, is explained underneath the line by the hybrid expression i.e. the 
letter ba’ with the vowel sign kasra followed by the character zi to render the 
Chinese word bizi M-zP for ‘nose’. The reason for this strange combination of 
Arabic and Chinese may be the difficulty the student had in finding an Arabic 
letter or compound to adequately represent the Chinese syllable zi (he may 
also have had trouble remembering the Chinese character for ‘nose’ or found 
it cumbersome to write the numerous strokes it consists of). The Chinese word 
for ‘parrot’ ( yingwu Hill ) 30 on p. 56 is given in an analogous format, i.e. the first 
word of the compound is written in Arabic letters ( hamza with two kasra’s). A 
further example is the expression maocao (the translation [p. 62] correctly has 
caomao ^tl), or ‘straw hat’, ( qalansuwwat al-hashish) written as a hybrid form 
(J^>) between the lines of p. 74. The expression serves as a means of describing 
the shape of a fragrant flower which grew on the roof of the mosque in Pingliang 
and symbolised the Pingliang Taiye as the head of the order. The hybrid marginal 
gloss on p. 78 consists of a combination of A (= lii PJ ‘green’) and the character ¥ 
(cui) to render the Arabic word zakharif ‘splendour, decoration’ (the meaning of 
the compound cuilii W-W is ‘emerald green’). 


30 The Kitab al-Jahri uses the Persian word tuti instead of Arabic babbagha’. 
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3 Manaqib-i Awliya ‘The virtues of saints’ 

Another literary version of the sacred history of the Jahriyya Menhuan is the 
Manaqib (M.-i Awliya), and it seems to show a higher proportion of xiaojing 
writing than does the Kitab al-Jahri. The Arabic text in question was composed by 
Zhanye lift ‘Abd al-Ahad in the twentieth year of the Republic of China (= 1931). 
The manuscript published as a facsimile of a khatt-i sini manuscript with a total of 
361 pages was copied by Ma Lugou Sadiqullah of Xiji in Ningxia province 

at the order of his shaykh. There is a colophon (p. 361) including the name of the 
scribe ( min yad ‘by [the] hand [of]’ etc.), Sadiqullah ibn Qamar, but neither a date 
is stated nor the place of publication of the facsimile. The Manaqib, which is com¬ 
posed of five chapters, was designed to serve as a supplement to the Rashahat 
‘percolations’ by ‘Abd al-Qadir. In contrast to the latter work, which only pro¬ 
vides biographical information on Ma Mingxin and the Pingliang Taiye, Zhanye 
added the lives of three more successors of the Pathfounder, namely Chuanchang 
Taiye Qutb ul-'alam (p. 82ff.), Siyueba Taiye A AH (p. 112ff.) and 

Shisan Taiye +HAH (p. 194ff.). 31 The Arabic work was translated into Chinese 
by Ma Siren A St', and the Chinese version entitled Mannageibu was published 
in Zhongwei, Ningxia province, in 2012. 

Not only are there numerous explanatory glosses between the lines (e.g. the 
unfamiliar Arabic toponym yamm al-timsah al-aswad, ‘Sea of the Black Croco¬ 
dile’, a literal, albeit slightly inaccurate translation of the name of the province of 
Heilongjiang, ‘Black Dragon River’, which has been transcribed above the line as 
j’A 3 -i. (p. 83). There are also some xiaojing glosses written in the margins (pp. 
194 and 291; a few other glosses are in Chinese characters, e.g. on p. 250). 

There is a relatively high number of Chinese poems written in the Arabic 
script in all five chapters (pp. 71, 206, 207, 211, 212, 216, 226, 241 and 264-265). 32 
Some lexical glosses in xiaojing between the lines of poetry quotations have 
been written obliquely above the words that are explained (e.g. on p. 28 where 
Ma Mingxin quotes verses of the Mukhammas while exhorting his disciples to be 
assiduous in their spiritual practices). 


31 Another somewhat shorter text on the history of the Jahriyya entitled Rashf has also been 
added on pages 362-395. This was written in year 8 of the Republic (= 1919), apparently by the 
same scribe. 

32 Zhanye’s text also includes some poems in Persian (on pp. 279, 346, 349 and 351) and Arabic 
(pp. 166-167). 
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Fig. 3: Oblique annotations, Zhan Ye’s Manaqib al-Jahriyya, p. 28. 


The layout of the poems is a little inconsistent. On one hand, some of the poems are 
written with spaces between the Chinese verses, which are treated like hemistichs of 
Arabic poetry, and they are separated by graphical elements as well (e.g. on pp. 71 
and 211). Other poetical quotations, on the other hand, are written continuously 
with a raised inverted comma serving as a verse marker (as on pp. 226 and 264-265). 

Besides this, there are a few mysterious Chinese prose words uttered by the 
shaykhs of the Jahriyya and transcribed in the Arabic script, which answer ques¬ 
tions such as ‘how will the Islamic religion be fulfilled?’ (literally ‘sealed’; p. 170f.). 
As an answer, some Chinese nouns are quoted in juxtaposition and the number of 
words is mentioned. On p. 171 (= Ma 2012,73), for instance, Shaykh Qutb ul- alam 
answers the above question as follows: ‘When [the meanings of] five Chinese 
characters are fully manifested ( tammat ), Islam will return to its origin, namely 
<-!<_>■ j y ( dao , wu [?], sin, lun, ban-, [!, i.e. fft]fiM)’; the transliterated 

Chinese words are identified, in a parallel transmission of the same text, through 
use of Chinese characters in the translation (Ma 2012, 50 = Manaqib, p. 119), but 
unfortunately an explanation of the relevance of these terms is not provided in 
the respective passages (the five nouns may be translated literally as ‘way’, ‘duty’, 
‘monk’, ‘dragon’ and ‘assistance’). 

Another relevant ‘transmission’ (the passages are introduced by ruwiya, ‘it 
has been transmitted’) is Manaqib on p. 95 (= Ma 2012, 41): ‘When nine Chinese 
characters are realised, Jahri Islam will spread’ ( idha tahaqqaqa tis c at ahruf 
al-siniyya yantashiru l-islam al-jahri). 
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4 Xiaojing in other Sino-Arabic manuscripts 

4.1 Ashi“at-i Lama'at 

Another khatt-i sini manuscript with content that is distinctively Sufi in nature 
also includes glosses in xiaojing and was published in facsimile (no place, no 
date; the Chinese preface is dated 1987). It is an Arabic translation of a Persian 
commentary on the famous Persian mystical tract entitled Kitab al-Lama ‘at ‘Book 
of [divine] flares’ by Fakhr ul-Din ‘Iraqi (d. 686/1287 or later). In Northwest China, 
the Lama'at has traditionally been studied together with a specific derivative 
text called the Ashi“at-i Lama'at ‘Gleams from the flares’ by Nur ul-Din Jam! (d. 
898/1492), 33 and notwithstanding the fact that the (bilingual) title on the book 
cover is ‘Kitab al-Lama'at bi-lisan ‘arabi ‘Book of [divine] flares in the Arabic lan¬ 
guage’ - Guangdian xueli the text is none other than that of Jaml’s com¬ 

mentary, albeit in an Arabic translation. The copying of the text was completed 
on 2 Dhu al-Hijja 1410 (26 June 1990), as indicated on p. 255. Besides the Arabic 
translation, the manuscript also contains the poetical lines of ‘Iraqi (or JamI) in 
the Persian text, which are overlined. Wide spaces have been left between the 
lines on a number of pages, which were meant to be filled with glosses of Chinese 
characters usually accompanied by xiaojing transcriptions. 34 In a few exceptional 
cases, as on pp. 65, 73, 102 and 106, characters have been added in the margins 
along with xiaojing. The broad margins have largely been filled with explanatory 
glosses in Arabic written obliquely against the frame of the text panel, mostly 
starting with qil, ‘it was said’, qawluhu, ‘his word’ or wa-l-murdd, ‘this means’. 

The poems (which are often quatrains) are quoted within the prose text in 
the original Persian language, followed by an Arabic translation. 35 They are also 
accompanied by Chinese translations written in Chinese characters and xiaojing. 
As for the format, two lines of characters and two lines of xiaojing are fitted in the 
wide space between two lines of the main text. For instance, on p. 48 (Fig. 4) line 


33 Jaml’s commentary was translated under the title of Zhaoyuan mijue WjJtfi&nk by Po Nachi 

SSlJftlS (She Qiling who died in 1710; cf. Ma 1999 [1985], 31; Luo 2007a, 500. We still 

need to find out whether or not the Chinese translations added to the Persian poems scattered 
throughout the text are those of the Zhaoyuan mijue. Zhou (2008, 78) includes a picture of the 
title page of the translation by She Qiling. 

34 E.g. on pp. 7, 8, 9, 43-45, 46 (no Chinese characters), 48, 49, 60-62, 69, 70,107-113 and 119. 

35 At the beginning of the text (p. 9), the Arabic translation is denoted as ta’rib in a gloss written 
above the line. 
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1 is a Persian verse 36 and line 2 is a Chinese translation of the last line on the pre¬ 
vious page (= the first verse of the Persian Ruba i) written in characters. This line 
is followed by line 3 which is a xiaojing transcription, followed by line 4 written 
in characters, providing translation for (Persian) line 1. Line 4 is followed by line 
5, which again is in xiaojing. The first line on p. 48 is this: 

N ->. Vs (jl j! jl Lai Aa-oj ^ ^Al 


‘I would like to call him by a hundred names. He is too sublime to fit into a name, though’. 

Lines 4 and 5, which are awkward Chinese translations of the above verses, run 
as follows: 

^ ^ I^jj ^ j \ ILj 

37 >knr^flfip* 


‘I shall praise the honoured one in a thousand names / there is nothing as high and myste¬ 
rious (as him) who is not to be found (by any human means)’. 

The Arabic translation is in line 7: 


^ujl V - 1 <_fill: ^ .X—I l . ~ 


Thus, the structure of such complex translational technic is as follows: Persian is 
translated into Chinese, Chinese is transliterated in xiaojing which is followed by 
translation into Arabic. 

However, not all the translations of Persian verses are consistently represen¬ 
ted in this way: on p. 46 a quatrain quoted is followed by two lines of xiaojing 
translation, but a character version is missing. 


36 This corresponds to p. 53 of the Lama 1 at edition by Hadi Rastegar Moqaddm Govhari (thus 
transcribed on the English title page), Qum: Bustan-e Ketab Publishers, 1390/2011. 

37 The Chinese characters written from right to left may be transliterated in pinyin as follows: 
yujiangqian hao chengzun que / wu naxuangaobu keqiu, corresponding to the tentative reading 
of the xiaojing line: wo jian qian hao chen zun que (?) /wuna xuan gao bu ke qiu. 
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Besides that, xiaojing glosses are included between the lines, explaining 
Arabic expressions, as on p. 46 where the central expression lama‘an in the 
phrase wa-bana lama‘anu kamalihi, ‘the glow of His perfection has appeared’, 
is explained as <jljS u'S xian jian guan = xian dian guang ffiHSTfc. As for the 
orthography, it is worth noting that the phoneme x- (as in xian II) has been tran¬ 
scribed as h and d- (in dian ) as 



Fig. 4: TheKitab £j/-tcjmcj'of(divinefiares), facsimile manuscript, no date, no publisher’s information. 


4.2 Xiaojing glosses between the lines and in the margins: a 
Persian grammar 

Chinese glosses written in the Arabic script have also been written in the margins 
and between the lines of the main text in a facsimile manuscript of the Persian 
grammar entitled Minhaj /K7I/l [ rr)P, which was published recently (there is no 
indication of the date and place, though) and sold in a Muslim bookshop in Ping- 
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liang in 2014. 38 The Persian work is one of the madrasa books used in Northwest 
China and listed by Ma (1999 [1985], 29): ‘The author of the Hawa-i minhaj is the 
Chinese Muslim scholar Chang Zhimei The book has been written in 

Persian and gives a summary of Persian grammar. It was designed to help stu¬ 
dents in mastering the Persian language’. 

The incipit of the facsimile manuscript written in a distinctive khatt-i sini 
style 39 goes <-^1 jj <5 JUj -dll Jju, i.e. ‘Know - may God 

inspire you - that Persian words are of three types’). There are just 12 lines to each 
page. The booklet contains a total of 66 pages altogether and includes a colophon 
which mentions the name of the scribe and date of completion. According to this, 
it was copied in Rajab 1070 (^dU* j J »(= March-April 1660) by Muhammad ibn 
al-Hakim al-Zinami (?) al-Shanduni al-Sini, ^4-^' ^5 1 ', i.e. a scribe orig¬ 
inating from Jinan in Shandong province. Beneath the place name, ‘al-Zaynami’, 
the explanatory word ‘Zinan-fu’ has been written in a minuscule script with a 
vocalisation which is standard for xiaojing, i.e. doubling of vowel signs within the 
rasm of a word - in this case, doubled fatha above nun, namely ^J (Zinan-fu = 
‘Jinan prefecture). 

The glosses in this manuscript are mostly in Persian, but sometimes in Arabic 
and occasionally in Chinese, which is written in the same style of the Arabic script 
as the glosses in the two other languages. As a rule, the explanations have been 
added above the word that is explained, as in the first sentence on p. 1 where it 
says that Persian words are of three types, namely nouns, verbs and adjectives 
(; ism, fid and harf). Two examples are given for each type. The category of the verb 
is exemplified by zad wa gasht, ‘he hit and turned away’, the first example being 
translated by Chinese G (= da f]’ ‘to hit’). As elsewhere (a mark of orthographic 
standardisation), fatha has been added to the letter dal, although it is followed 
by alif and is therefore unnecessary phonetically. A xiaojing gloss written beneath 
the line can be seen on p. 54: in the overlined phrase ^ U j ^bU jll jj jj ‘i can 

free you from your ignorance’, the noun nadani, ‘ignorance’, is explained as ^ m 
j-: bu zhi dao, ‘[I] don’t know’ (or: ‘not knowing’ which is equal to ‘ignorance’). 
It may be noted that in this transcription, the letter jim, although occurring in 
isolation, is written in the initial and not the final form, which is contrary to the 
conventions of the Arabic script, but standard in xiaojing orthography. 


38 The book cover only contains the four characters (Mi nai ha zhi), which transcribe the Arabic/ 
Persian title. 

39 The tapering ends of the letters waw and ra’ are conspicuous here. Besides this, no distinction 
has been made between the other consonants (ba\leaf etc.) and their equivalents in ‘Ajami. 
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Fig- 5: The Minhaj (a manual on Persian grammar), facsimile manuscript, no date, no publish¬ 
er’s information. 


4.3 Independent xiaojing texts: Ma Tianmin’s catechisms 

4.3.1 The catechism Tahawur al-kalam fjmasa’il al-salatwa-l-siyam 

There are also manuscripts where xiaojing does not appear in the form of glosses 
in the margins or elsewhere, but where it figures in its own right. This is the 
case with some texts written by the 20 th -century author Ma Tianmin UjJ< K Ibn 






198 - Florian Sobieroj 


Ma'sum ‘Abd al-Rahman, as in the Tahawur al-kalam fi masa’il al-salat wa-l-si- 
yam, a catechism on issues of ritual prayer and fasting. 40 In the facsimile man¬ 
uscript published in 1406 (= 1985-6) in Baozi Mosque in Linxia BH 

JC - the title on both covers of the booklet is only in Chinese and runs Yisilanjiao 
libai fengzhai wenda ffi Pu]^, ‘Questions and Answers on Islamic 
Prayer and Fasting’ - six lines of Arabic text are accompanied by ten lines of a 
corresponding Chinese text written in the Arabic script covering about two thirds 
of each page. Rather than constituting a literal translation of the Arabic text, the 
xiaojing 41 text is an independent version of Ma’s catechism, which is made up 
of questions and answers. Unlike the Arabic text, the xiaojing text is written in a 
slightly larger script with vocalisation and punctuation marks (commas, colons 
and full stops), which bears witness to the recent (20 th -century) date of the copy. 
The recurrent expressions wun j and da C, i.e. wen fnj, da 'fs ‘question/answer’, 
have been put in brackets. 

Starting at the opposite end of the booklet (pp. 2-22), there is a Chinese 
version of the text printed (in stark contrast to the manuscript character of the 
Arabic I xiaojing text!) in Chinese characters, which can be used as a key to under¬ 
standing the xiaojing text. The Chinese translation written in characters was 
made in 1993 by one Ma Xiqing J§#Jli in or near Baozi Mosque (p. 22). 


40 Another work - an Islamic catechism entitled Tahawur al-kalam fi ‘aqa’id al-islam, ‘On the 
articles of Islamic belief’, which was written in 1952 - was composed in the same format (and 
published in Linxia as a facsimile manuscript: Baozi Mosque, Shawwal 2004 [reprint]). I have 
included a short discussion of the text in Sobieroj 2016b, 61-62. The introduction to the catechism 
states on p. 3 that the author has also written various other works: (i) only in Arabic (five 
Chinese [!] titles are listed), (ii) in Arabic accompanied by a xiaojing translation (the catechism 
and two more Chinese titles) as well as (iii) some Chinese texts written exclusively in an Arabic 
transcription (five titles). The author’s Tuhfat al-tullabfima'rifat al-siyagh wa-TVrab on grammar 
seems to represent the type of works written in Arabic without xiaojing. This impression emerges 
from the study of the reprint of a 41-page booklet first published in 1376/1956 in Hanjiasi Mosque 
in Linxia (‘[...] printed again after completion of a Chinese trans.’ [p. 1]; Baozi Mosque 1410/1990). 
The grammar designated on the title page, Qawd’id al-nahw - alaboyu jianming yufa jingtang 
yu PBffvfQmffiEHm'ftMZi'tH, ‘Succinct Arabic Grammar ( madrasa language)’, was designed to 
be used by beginners after mastering morphology Cilm al-sarf) at the madrasa al-thanawiyya 
al-‘arabiyya (p. 1 or second title page) and it includes the printed Arabic text with Chinese 
translations printed in characters on the same line. This seems to have become the standard 
format of modern editions of Arabic grammar texts such as the Kafiya, published in 2005 in 
Zhoupo JilK Eastern Mosque in Tai'an City rff, Shandong, for the use of madrasa students 
(manlia jingjianben SSlHIlffizk). 

41 The term ‘xiaojing’ is used in the Chinese preface of the Tahawur al-kalam fi ‘aqa’id al-islam 
and is probably also referred to in the Arabic introduction, viz., jSjji jibJI U (a minuscule 
letter ta’^ is placed above kaf *4 of 0 $ j# - a very uncommon orthographical addition). 
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The section on ablution, headed abdast - xiaodin (= xiao jing /.My- ‘minor 
ablution’), includes the following xiaojing passage: M ji i ^ (j) 

£l]1 4 £ -3^ 1 _ 3 ^ tjCl LS^ ‘ L 5 ^ • LS^ ( b ) • An 

attempt to transcribe the Chinese passage above using a system of transcription 
that closely depends on Pinyin but also takes account of the added vowel signs 
yields the following: ( Wun) xiaodin di fard shi shima, duwa shao? (Da) si jian: 
si nian (!), si lian shou, dao lian zhou, mu tou di si fun zhi i, si lian jiou, dao lian 
huai gu ‘Q: What duties are involved in the minor ablution? How many are there? 
A: There are four duties: to wash your face, to wash both your hands up to the 
elbow, to wipe one of the four sides of your head [and] to wash both of your feet 
up to the ankles’ (Fig. 6). 



Fig. 6: The catechism of Ma Tianmin Tahawur al-kalam fi ‘aqa’id al-islam (On the articles of 
Islamic belief). 


In the above section, the author does not explicitly say how to pronounce the 
letter dad in the Arabic noun fard, ‘duty’. In the Hanzi translation, fard is ren¬ 
dered by three characters with the phonetic value folezuo which may 

be interpreted in that the noun is pronounced closer to the Persian pronuncia¬ 
tion than to the Arabic. Mandarin lian for ‘face’ is transcribed as nian. Mandarin 
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jing for ‘ablution’ is rendered as din No distinction is made in the transcription 
between the consonants x- and s- in xi, ‘wash’, and si, ‘four’, both of which are 
rendered by the letter sin - in standard Chinese, the two sounds are clearly dis¬ 
tinguished in terms of their pronunciation. The consonant j- in jian, ‘condition’, 
for instance, is transcribed by the letter ^ with three dots above it and one kasra 
beneath it. The letter x- in xiaodin (= xiaojing, ‘minor ablution’) is rendered as 
H with kasra, while j- in jian is rendered as kaf with three dots, which seems to be 
inconsistent; in jing in the expression xiaojing (= xiaodin), the initial consonant 
is written as d-. Given that the initial /j/ in MC is a merger of Middle Chinese *kj/ 
gj and *dz/ts, it is plausible that this conservative dialect spoken in Linxia and 
represented in the manuscript has preserved some features of Middle Chinese. 

The author of the credo must have spoken this dialect because he was active 
as a madrasa teacher in Linxia. According to the Chinese introduction of the 
booklet on the credo, the author was teaching children at Hanjiasi Mosque 

in Linxia where he was known as ‘Jingkou si shifu’ # P H Df # (‘the fourth 
master of a place called Jingkou’; l; 


4.3.2 Xiningversion of Ma Tianmin’s catechism (‘ AqTda ) 

Another version of Ma Tianmin’s catechism paralleling the Linxia version 42 was 
published at Dongguan SRI Mosque 43 in Xining the capital of Qinghai 
province, 44 which borders on Gansu. The Xining version is entitled ‘Aqidat al-is- 


42 Cf. n. 41. 

43 The largest mosque in Xining and religious centre for at least 100,000 Muslims living in the 
town, it was founded in the Hongwu era (1328-98) of the Ming dynasty. The mosque was destroyed 
in reaction to an armed uprising by Hui and Salar Muslims led by the Jahrt Ahong Su Sishisan jS 0 
T"H and others against the Qing in 1781. It was rebuilt later, however, and was closed temporarily 
during the ‘Cultural Revolution’, then re-opened in 1979 (cf. Wu 1995, 459-460). 

44 The undated booklet, which I acquired in Xining around 2010, has a photograph of the 

Dongguan qingzhen dasi xuanli ta or ‘Great Eastern Bar Mosque with its 

minaret’ on the cover, which may be taken as an indication of the place of publication. The same 
mosque (and picture) also appears as the place of publication of an undated booklet on prayer 
( al-Ad’iya al-lazima wa-awrad al-salah / Changyong duwa libai nianci which 

contains the printed Arabic text of the prayers, but - contrary to what one should expect in light 
of analogous works - without any xiaojing. A transliteration of the text is provided in Chinese 
characters printed beneath the Arabic words, but according to the phonetic value of the individual 
characters (= Chinese ‘Ajami!) and without offering any clues as to the meaning. Occasionally an 
explanation in Arabic is provided in Chinese characters, explaining how and when to apply a 
prayer text, as on p. 23: ‘a prayer to recite while washing the right foot’ (xi you jiao nian de duwa; 
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lam Yisilan xinyang wenda ‘Questions and Answers on Islamic 

Belief’, and features a text in xiaojing transcription with simplified Chinese char¬ 
acters written between the lines in the same direction as the Arabic script. In 
contrast to the manuscript nature of the risala, the Arabic is printed (!) in a bold 
script, whereas the Chinese characters have been added in smaller types (seem¬ 
ingly also printed), from which it follows that the reader’s eye is immediately 
attracted to the Arabic text rather than the Chinese. Unlike the Linxia publication 
of Ma Tianmin’s text, the Xining ‘Aqida neither includes the Arabic words of the 
catechism nor a Mandarin Chinese version printed as a separate text. Comparing 
the interlinear Chinese characters of the Xining version (XN) with the Chinese 
version in the Linxia publication (LX), it emerges that there are discrepancies in 
the words and even in the structure of the sentences. In the section on disbe¬ 
lief, kufr is defined as negating what is obligatory to believe in and to look down 
upon what is exalted by the law. Whereas the Chinese has the words fouren bixu 
xinyang de I, ‘to negate what one must believe in’, the interlinear 

text of XN (p. 55) and the xiaojing text of LX (p. 40) written on the lower part of 
the pages has yinmei guixin ta biding de nage or / ls 

<j5 1 o*, the main differences being in terms of syntax, and the lexical var¬ 

iants fouren/yinmei of IStt, as indicated by the vowel sign, may have to be 
pronounced with a diphthong analogous to gui if); this results in two texts that 
appear to be almost completely different. 

The above differences may be due to the effort made by the editors of the 
‘Aqida to give Chinese character equivalents to the vernacular of Xining in the 
one publication, whereas the translation in the Linxia edition is Mandarin and 
does not give any equivalents of the xiaojing transliteration of the spoken lan¬ 
guage (of the Linxia region). 

Then there are differences in the two publications relating to the orthography 
rather than to pronunciation. Here are some examples of the orthographical dif¬ 
ferences: ta shi In*!, i.e. ‘it \kufr] is’, written as ^ G in LX and <_£ in XN; qingshi 
zai she-ri-er libian zunzhong 13 ‘it is to belittle what is hon¬ 
oured in the law’, is written as V -h G3 £ ^ ^ in LX and 

in XN; nage 315(0, ‘that one’, is written in LX and j 0 in XN; zhenzhu JC35, ‘true 
lord’, i.e. God, is written j in LX and » j in XN. The differences in spelling are 
indicative of a lack of shared standards (or of a discrepancy between the standard 
language and the dialect) despite the mutual geographical proximity of Xining 
and Linxia. 


the prayer text in characters runs: anla hunmai sanbitigaidaimanye etc., i.e. Allahumma 

thabbit qadamayya ‘God, make my feet firm!’). 
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4.4 Xiaojing translations following in the same line 

4.4.1 K. Fiqh KaydanT and two related texts 

In a facsimile copy of the famous Arabic work on the Islamic ritual prayer compo¬ 
sed by Lutfallah al-Kaydani (lived c. 750/1349; GAL, vol. 2: 253) and hence entitled 
K. Fiqh Kaydani (pp. 43-108), the xiaojing translations are not added as glosses, 
but follow in the line immediately after the Arabic wording, which they translate 
into Chinese. The xiaojing is written in the same khatt-i sini style of the Arabic 
script as the words of the Arabic text, which is overlined as a rule, but it usually 
takes up 2-3 times the space of the Arabic expressions translated. The xiaojing is 
vocalised, and the individual sentences are mostly introduced by the expression 
ta yao [shuojde shi or ye shi, ‘he means to say (also)’. 45 

The text copied in 1316/1898 - the colophon on p. 108 says ‘the translation of 
this book was completed in RabT II 1316/Aug.-Sept. 1898’ - forms the second of 
three parts all written by the same hand and in the same format with Arabic and 
xiaojing following in the same line, and was published on 30 September 2008 (no 
place mentioned) as a booklet bearing a bilingual title on the cover page: Kaydan 
Kitah and /hMWMM (Xiaojing Kaidani). 

The third text in the collection (on pp. 109-132) is the well-known and 
much-copied tract - in the Ottoman lands - on the condition of the ritual prayer 
Shurut al-salah (cf. Sobieroj 2010, no. 191, part 3), while the first text (pp. 2-42) 
is a less well-known anonymous text on dogma and ethics whose title is given in 
the preface as Risala tata‘allaqu bi-bayan al-Vtiqad wa-l-akhldq wa-l-af‘al (‘ala 
l-tartib wa-l-ijmal ) (‘A tract relating to the exposition of the creed, ethics and 
works, arranged logically in a summarising way’). 

While a colophon has been added to the second text, the third one is followed 
by a list of contents written in Arabic and headed by the Chinese word mulu § 
for ‘table of contents’ and dated 30 September 2008 (again in characters). 


45 An attempt to transcribe the beginning of the xiaojing text yields the following: ta yao de shi 
quanzan yu zhongshi weidu tiaoyang shiba tianyangkhalq de nagazhu de 

etc.) [...] G ‘ J-i ^ 42 , j# lsj o 5 » 1 j' i (ji .ijjlj (-_)j All 

jjiU 4 jsUJI j j 5=. -Gj). Whereas in two accessible Mandarin translations of the Qur’anic expres¬ 
sion al-‘alamin (the worlds) occurring in Sura 1:2, for instance, yuzhou ^lla or quan shijie 
is used, the xiaojing employs the Arabic noun khalq plus prefixed shiba tian + i.e. ‘the one 
lord who nurtures’ the eighteen heavens - the latter classification being common in Buddhism. 
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4.4.2 Xiaojing Qur’an translation 

The format of xiaojing translations following immediately after a reference text in 
the same line as has been chosen in the above Chinese edition of the Fiqh Kaydani 
has also been adopted for a full translation of the Qur’an. 46 A combination of 
printed Arabic texts and handwritten xiaojing translations that is reminiscent 
of an incunabulum is inscribed in the spaces between the individual Qur’anic 
verses initially left empty. This work was published on gum‘a (Friday), 23 August 
2002 (no place mentioned). The Arabic text in the publication is a transcription 
of the readings of Hafs ‘an ‘Asim , 47 which corresponds to the riwaya of the official 
Cairo edition from 1924. The Qur’anic verses are overlined and separated from 
the translations by verse markers with inscribed Arabic numbers. The Chinese 
translations are written in a fully vocalised Arabic script which follows xiaojing 
conventions as described in the introductory section. The language of the transla¬ 
tion is a Chinese vernacular, which emerges from the pronunciation of individual 
words as well as from the syntax. Surat al-ikhlas, one of the shorter chapters at 
the end of the mushafj no. 113), will be looked at by way of example. The xiaojing 
transcription of the first and the last verses (v. 1,3-4) runs as follows: 

48 if...] J C ji <111.2! U ji>i j i 

The unfamiliar orthography of what is pronounced as shuo ‘say’ in Mandarin 
is indicative of dialect, 49 as is sin for sheng ‘beget’ [children] (or for xing '[4 
‘gender, sex’?). The negation transcribed as maw you in xiaojing in v. 3, finally, is 
a dialect variant of Mandarin meiyou SW. 50 


46 A copy of this work was offered for sale in 2014 in a small mosque bookshop in the centre of 
Wuzhong City. 

47 I.e. the readings of ‘Asim (d. 744) as transmitted by his student Hafs. 

48 An attempt to transpose the above in pinyin yields the following: ay zhi sheng a ni shuo ta shi 
Allah duyi de ‘Prophet, say: he is God, the one’ (...) ta mei you xing ren / ren mei you xing ta / wu 
yige ren shi / yin wei ta zhe bu shi yi ban de. 

49 On the dialects of Northwest China, see Yang 1996, 3. 

50 With the aim of highlighting the specificities of the Chinese madrasa language (jingtang 
yu MUfm) of Ningxia/Gansu, Ding Shiren Tif— (Ding 2008, 48-53) contrasts a jingtang 
yu translation of the first five verses of Surat al-Baqara which he has transcribed in Chinese 
characters with a modern Chinese Qur’an translation (by Ma Jian J§IJ) of the verses in question 
(reproduced in Zhou 2008, 56-57). 
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4.5 Arabic-Chinese glossaries 

Another type of Islamic manuscripts contains Arabic-Chinese glossaries into 
which xiaojing transcriptions have been incorporated. These were compiled to 
help readers understand a number of texts traditionally taught at the madrasas. 51 
The reference texts to which the glossaries refer that I was able to consult are (1) 
Sa‘d al-DIn al-Taftazani (d. 791/1389; GAL, vol. 2: 278-280), Mukhtasar al-ma’ani, 
on rhetorics, and (2) ‘Mulla Tsam al-DIn’ [al-Isfara’inl; Sobieroj 2010: no. 185] 
t Manlia zidian - which, in fact, is Jaml’s Shark - on the Kafiya of Ibn 

al-Hajib on grammar. I found manuscripts of two different glossaries relating to 
Mulla ‘Isam al-DIn. 


4.5.1 Lughat Mukhtasar al-ma ‘am 

The beginning of the glossary entitled Lughat Mukhtasar al-ma‘ani al-mutarjama 
bi-lisan al-sini f [’I'ISj (/-'?/!#!- (Boyani zidian, ‘Dictionary of the Bayan’), six lines of 
which are included in the copy, has been written on the second page in Chinese 
characters and xiaojing . The incipit runs: <j4^' 

‘we praise you who have enabled us [lit. have widened our chest(s)/breast(s)] 
to summarise the explication’. The Arabic words of the incipit are written with 
blank spaces separating them, which are filled with translations ( women zanni 
huokai xiongtang zhaijian mingyan ‘we praise you 

for opening up [our] chest/breast so that we may select words of wisdom’ etc.’). 
The xiaojing transcriptions with vocalisation, on the other hand, are written 
above the Arabic expressions, i.e. G is ^ <jG ^3 ^ ji. j £ j. The three parts, 
Arabic, characters and xiaojing, have all been written by one and the same hand. 
Nunation is added at the end of some words, as in wamun (= women, ‘we’) and in 
thiun (= xiong, ‘breast/chest’; the consonant x- is transcribed by G). 

The glossary itself starts on the second half of this page. It is divided into sec¬ 
tions which follow the structure of the reference text and are headed by captions 
such as muqaddima, ‘ilm al-ma'ani, and ahwal al-isnad al-khabari. The format 
is four columns in which the individual lemmas are overlined and followed 
by Chinese translations in characters and xiaojing (the texts written in the two 
scripts are exactly the same). The first lemma, al-Taftazani, is explained in Arabic 
as [derived from] a place name (ism maudV, without a transcription). The second 
is al-faqir, ‘the poor’, followed by a Chinese translation in characters (written 


51 Zhou (2008, 58) includes a picture of a manuscript folio of an Arabic -xiaojing glossary. 
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from right to left: qiuji) and xiaojing: ^ j£. Thirdly, al-ghani, ‘the rich’, 

JSK etc. The lemmas are not arranged alphabetically, but according to the 
order of their occurrence in the text. 
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Fig. 7: The Bayan zidian (Dictionary of the Bayan). 
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The glossary ends on p. 70 with the date 28 March 1983 and adds a list of the 
poetic verses taken from a different manuscript, which are included in the Bayan 
with translations in characters and xiaojing written above the lines. The 26 pages 
of verses were published ( tubi'a [lit.: printed]) in 1982 at a place whose xiaojing 
transcription may be construed as the (topographically meaningful) name Sanji- 
aqiao, i.e. ‘three-family bridge’. The last verse, only five words of which are repre¬ 
sented in Chinese in the manuscript (marked by an asterisk here), runs: baqita* 
baqa’a l-dahri* yd kahfa* ahlihi / wa-hadha du'a’un li-l-bariyyati* shamilun*, ‘May 
you remain as long as the world/time remains, cave of his people / This is a prayer 
which encompasses the whole creation’. The five words translated as well as tran¬ 
scribed are ^ ifi , ; o'jS, ISI/M S y , J* ^ and y (Fig. 7). 

Owing to the atomistic nature of the xiaojing glosses, which only explain indi¬ 
vidual words, the tendency to translate text into a Chinese dialect seems to have 
been checked here. This contrasts with the passage-wise dialectal xiaojing transla¬ 
tion, as in the texts authored by Ma Tianmin, which were discussed in section 4.3. 


4.5.2 Lughat Mulla 'Isam al-Dm 
4.5.2.1 Qaryat al-Sanjiaqiao version 

Like the format of the Lughat Mukhtasar al-ma‘dni, the version of the Lughat 
Mulla 'Isam al-Din, which was published in facsimile form at the same place, i.e. 
Qaryat al-Sanjiaqiao, starts with an incipit of the text, only three lines of which 
are provided. The interspaces between the Arabic words are filled with Chinese 
characters, whose xiaojing transcriptions are written above (most of) the Arabic 
expressions, i.e. Quiidl j <1! j -up Jc. aJj] ‘Praise to Him 

who deserves it and prayers for His prophet and his (i.e. the latter’s) family and 
companions who have trained themselves in his manners’, 

[...] -tmm m a ^ ^ l?j b U 

J 4 J Ij (jLlii Jj ^ <^1 jjj y* 4 j jlj fa y — a [...] y^. 

In contrast to the Lughat Mukhtasar, the lemmas of this glossary are not 
arranged in columns, but are written from right to left in horizontal lines. The 
Arabic expressions are followed by translations in characters, above which xiao¬ 
jing is written. The booklet consists of 26 unnumbered pages of 17 lines ending 
with a colophon, which states the date of copying (20.4.1402/10.6.1983) and place 
of publication. In the gap between the explicit and colophon, the compiler has 
inserted a request in Arabic asking the scholars ( al-‘ulama’; i.e. the competent 
readers) to correct any mistakes he may have made in writing and translating. 

The system of transcription seems coherent internally, although it partly 
deviates from other authors’ systems. The scribe conventionally differenti- 
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ates between the consonants q- (as in quan which he transcribes as A and 
between j- (as in jiangM, transcribed by A); the [-q] sound of Mandarin Chinese, 
consistently rendered by n (jiang=jian <Al); the consonant r- (as in ren A), which 
is transcribed by ra‘ with three dots above it (j - a graph which corresponds to 
Persian zay ); and the consonant x- (as in xue j? A), which goes back to a h-sound 
and is rendered by the letter tha’ (Fig. 8). 
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Fig. 8: The Lughat Mulla ‘Isam al-Dm (A glossary on Islamic faith). 
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4.5.2.2 Kaifeng version 

The second facsimile manuscript of the Lughat al-Mulla (Manlia zidian ) was pub¬ 
lished in 1356/1937-38, or the 26 th year of the Republic of China 'j$ 4- 1 K; 
18 (‘Chinese republic’ with the phoneme h / kh in hua [China] ‘irregularly’ written 
as -a <h>, and final velar nasal [-rj] [in zhong], ‘regularly’ written as <n>) in the city 
of Kaifeng ^ lifts I I in Henan province under the auspices of Shams al-Din >?■ 
Zhenjiang, and it was purchased in Pingliang in 2014. 

In this booklet, the lemmas are listed in two columns (mostly as two keywords 
with their translations, which have been fitted into one line in each column) and 
they are arranged in order of their occurrence in the glossary. The Arabic expres¬ 
sions written in a bold khatt-i sini style (fat feet, tapering upstrokes) 52 are followed 
by xiaojing without any characters (there are, exceptionally, a few characters 
written above the xiaojing words), but characters re-emerge in the Chinese-lan- 
guage colophon where the date of publication is stated as well as the name of the 
editor ( bianjizhe ), Wang Hongxiang from Zhenjiang Ih/f.T) 53 

The translation of the first lemma AW ‘praise to Him who deserves it’, 
starts with the same words as in the Sanjiaqiao manuscript (plus the copula shi 
t§), which are transcribed differently, however, namely as follows: jJU i j ' j IS 
(in the Sanjiaqiao manuscript, it reads y lsj J* ' j for Pinyin Mandarin: quan 
zan shi wei du ying). The difference could be attributed to the influence of the 
regional dialect of Henan where the text was published, although it is far from 
certain that there was any such influence. The text may have been transcribed in 
the Northwest and only been published in Kaifeng (as a reprint, zai ban | f J-J[fi). If 
that were the case, it would be evidence of an extreme inconsistency in writing 
xiaojing and is therefore highly unlikely. However, there is also a chronological 
aspect that needs to be considered: the Kaifeng publication is half a century older 
than the modern Sanjiaqiao manuscript, which is also apparent in the usage of 
Persian words explaining some lemmas that are not found in the modern manu¬ 
script (e.g. zaman, ‘time’, for yawm, p. 77; siyah, ‘black’, for as-sawad, sebid (sic!), 
‘white’, for al-bayad, p. 78; and l eshq, ‘passionate love’, for al-hawa, p. 70). 


52 For these characteristics see Sobieroj 2014,103-106. 

53 The last page opposite the explicit contains a pricelist for ‘[Islamic] canonical books’ 
(jingshu ISd) which were available for sale at the Wenhua street mosque in Kaifeng. Apart from 
the present text, they include a glossary entitled Lughat al-Wiqaya relating to al-Wiqaya , the 
textbook on Hanafi law. 
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4.6 Munajat 

Xiaojing is found in two places in a facsimile khatt-i sini manuscript which mainly 
includes an assembly of Qur’anic verses designed to be used as prayers for 
Muslims in Gansu : 54 firstly, on the title page inscribed beneath the title Munajat, 
i.e. ‘prayers of intimate discourse’, and secondly, in the colophon. The Chinese 
words written in the Arabic script inform the reader (albeit inaccurately) about the 
contents of the booklet, which spans 54 pages of five lines per page. The Chinese 
subtitle runs cuJ yj LL i.e. 33 Qur’anic verses/passages ( duan transcribes 

the character W). The numbering of the ay at is written in Persian ( seyyom ayat, 
caharom ayat, etc.) in a different script. This indicates that the Persian-language 
glosses in khatt-i sini manuscripts serve a similar explanatory or text-structuring 
function to xiaojing. As a matter of fact, the number of Qur’anic verses included 
in the Munajat far exceeds 33, hence the noun duan should be understood as 
denoting passages or even chapters - the first of which is Surat al-Fatiha - rather 
than verses. 

The khatt-i sini style in which the Qur’anic verses have been written has a cal¬ 
ligraphic quality and features ‘fat feet and slender ankles’, which are refinements 
usually found in copies of the sacred Islamic texts in China. The prayer begin¬ 
ning with Alldhumma 'ftah land abwab al-rahma, ‘God, open to us the gates of 
mercy!’, which follows Sura 114, i.e. the last of the Qur’an quotations, in contrast, 
is written in a different, unpretentious script which does not feature any of the 
above characteristics. 

The colophon on the last page includes the name of the scribe, Mahmud ibn 
al-Khabir ibn al-Shaykh al-Rasul al-Watouwi. On p. 55, the main text borders on 
two rectangular panels which are each filled with one line of xiaojing written ver¬ 
tically from top to bottom in the same style as the Persian text dividers, and they 
contain a note on the person who is to recite the 33rd section. 


5 Conclusion 

The influence of Arabic manuscript culture has fostered processes of standardisa¬ 
tion in khatt-i sini manuscripts. This can be seen in the adoption of familiar page- 


54 The prayer booklet was sold in 2012, more or less illegally, in one of the small shops on the 
main road in Guanghe Sid, a village and centre of Islamic culture in Gansu situated south of 
Lanzhou. Like many other publications, it lay hidden under some prayer scarves, carpets and 
other day-to-day objects used by Muslims. 
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layout formats by the Chinese Muslim scribes and artisans. Wide spaces have 
been left between the lines of the main text, which is also the case in Ottoman 
manuscripts where Turkish or Persian glosses have been added to explain certain 
Arabic expressions. And yet the division of the page into a section including an 
Arabic reference text and a (larger) section reserved for translation into a non- 
Arabic language, as is the case in Ma Tianmin’s xiaojing catechisms, is a format 
hardly ever seen in manuscripts originating in the core Islamic countries. 

Among the many different kinds of page layout and combinations of langua¬ 
ges and scripts adopted in the khatt-i sini manuscripts including xiaojing texts (or 
glosses), we can find five main types: 

1. xiaojing in conjunction with translations in Chinese characters added between 
lines of Arabic poetry quotations ( Lama‘at); 

2. translations written obliquely as isolated glosses, up against selected expres¬ 
sions of a reference text (the grammar book called Minhaj); 

3. xiaojing figuring on a par with an Arabic reference text by covering the larger 
part of the page ( Tahawur al-kalam); 

4. substitution of Arabic text by xiaojing, which is accompanied by Chinese 
characters written between the lines that reproduce the xiaojing vernacular 
{'Aqida)\ and 

5. a widely used format adopted in Qur’an translations (and other texts), which 
features Arabic followed by xiaojing translations in the same line. 

Three different types can be noticed in Arabic-Chinese glossaries: xiaojing trans¬ 
lations written above the word to be explained (similar to type 2) or inscribed in 
the line following the lemmas which are listed in two or more columns (similar to 
type 5), or xiaojing translations additionally accompanied by characters, which is 
what the scribe who penned a copy of the Lughat Mukhtasar al-ma ‘ani did. 

On a micro-level, it has been observed that xiaojing poems included in the 
Arabic manuscripts imitate the classical models of Arabic poetry quotation in 
terms of their conventions: the Chinese poems are introduced by the conventio¬ 
nal Arabic formula anshada, ‘he recited’, and the verses that are fully vocalised 
are separated by graphical elements such as circles inserted individually or in 
clusters (see above pp. 187 and 192). 

Classical Arabic has influenced the Chinese vernacular inscribed in khatt-i 
sini manuscripts directly (one example being the use of the word khalq in a 
xiaojing catechism where the translation written in characters offers a Chinese 
equivalent) and through mediation of Persian, which Hui Muslims consider their 
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ancestral language. 55 The contribution of the Persian writing system to Sino-Ar- 
abic manuscript culture can be seen in many ways: letters such as cim and zay 
used in xiaojing to transcribe the Chinese consonant r- as in ren A [‘man’] have 
been borrowed from the Persian alphabet. Persian literary works are studied at 
the madrasas (although nowadays the influence of Persian is receding due to the 
increasing weight of the Arabic language) and they are quoted in the texts intro¬ 
duced in this study. The oldest of the group of texts narrating the hagiography of 
Ma Mingxin, the Rashahat, was partly written in Persian. Xiaojing translations 
have not only been added to Arabic texts, but also to works in the Persian lan¬ 
guage such as the Minhaj. Persian insertions have even served to structure the 
text of some Sino-Arabic works (e.g. Munajat) and, finally, Persian words have 
been used to explain Arabic lemmas in the manuscripts of the glossaries. 

Traditional cultural practices like dictation of text by the mudarris during a 
lesson were followed by the Chinese madrasas, but the amenability of xiaojing to 
misinterpretation (as shown in the funny story of fiqh related among Hui students 
about the confusion of wine and chicken oil due to shortcomings in the Arabic 
transcription of Chinese) may have impeded its full adoption. The hybrid forms 
of xiaojing graphemes and characters written in manuscripts by students (pro¬ 
bably during lectures) also indicates that people found it awkward to write xiao¬ 
jing. Chinese students were constantly confronted with having to make a choice 
between characters and xiaojing, and the aversion to writing characters because 
of religious reservations ascribed to earlier generations of Muslims had long 
been overcome, hence the presence of characters in the margins of khatt-i sini 
manuscripts and the ongoing publication of Chinese translations of Arabic texts. 

As for the role of religion, specific sacred Islamic texts such as the Qur’an and 
the liturgical texts recited daily within the Jahriyya Sufi order have all played a role in 
the standardisation of the script adopted for literary transmission. Unlike works of 
tafsir or biographies (even including Jahri hagiography), these texts - Mukhammas 
(Qasidat al-Burda), Mada ’ih, Mawlid al-nabi and Munajat - have required the pious 
scribe to choose a calligraphically valuable, traditional khatt-i sini style showing 
features which are mostly absent from copies of manuscripts from other genres. 

In the liturgical texts just mentioned, standardisation in handwriting and 
choice of format has been achieved in mutual dependency. The texts have all 
been written in the bold, hieratical variant of khatt-i sini, mostly in a layout of five 
lines a page (the two-volume Mawlid edition has seven lines), 56 which, at least in 


55 The numerous borrowings from Arabic (and Persian) have been listed in various glossaries 
(e.g. Yang 1996, 87-122). 

56 A manuscript of the Kitab al-Mawlid on the celebration of the Prophet’s birth, copied by 
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China, has also been the preferred format of the partial Qur’an texts (juz ’) pub¬ 
lished in Yunnan province as wood or stone prints. However, the five-line format 
is also a natural choice because in the Mukhammas the text consists of units of 
five hemistichs. The liturgical texts are left clear of Chinese glosses despite the 
large empty spaces separating the lines, but publications of Mukhammas facsimi¬ 
les with a Chinese translation printed or handwritten on opposite pages are very 
popular among Muslims in the region. Part of the traditional format of the Muk¬ 
hammas is also the addition of a Persian poetical translation written in two-and- 
a-half lines at the bottom of each page. In a modern facsimile publication of the 
Mada’ih (Hong Kong 2005), however, a Chinese translation written in characters 
with a five-line stanza is provided at the bottom of each page. 
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Fig. 9: A printed text of the ‘Aqidat al-islam which includes the transliteration tables. 


the above-mentioned Mahmud ibn al-Khabir and published in facsimile in 1999. It is stated on 
p. 127 of the tiwen (‘treatise’?) part of the book that the Kitab al-Mawlid was bequeathed 
by Ahmad ‘Aqlla al-Makkl [in Arabia] to the Sufi Abu al-Futuh Ma Laichi JS?|£M (d. 1766), Ma 
Mingxin’s colleague. 
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Attempts have been made to standardise the orthography of xiaojing texts 
and paratexts in manuscript production. A case in point is the xiaojing syllabary 
appended, with a didactic motive, to the Xining edition of Ma Tianmin’s much- 
studied ‘Aqida (Fig. 9). 

Processes of standardisation have been at work, as is indicated by the regular 
presence of vowel signs added to the transcriptions in the manuscripts consulted 
and by the doubling of fatha, kasra and damma or the redundant fatha, all of 
which are characteristic features of xiaojing. The rule that the letter jim is written 
in the initial form even when it occurs in isolation is generally observed. 

Even so, there are still some major inconsistencies in the graphic representa¬ 
tion of a number of Chinese sounds: 57 

a. the representation of the initial non-phonemic consonant sound in words 
such as an, ‘peace’, or er, ‘child’, is inconsistent, for example [ghayn and 'ayn 
are used interchangeably; in the proverb ‘building the covered way...’ in K. 
al-Jahri, p. 113, an is rendered by ghayn + doubled fatha); 

b. consonant x- (a fusion of the two MC initials s- and h-): in the above proverb, 
xiu, ‘build’, is written with sin as (as it may be traced back to an older s-); 
sin also appears in xi, ‘wash’; however, sin has also been used to transcribe 
s- in (unvoiced) si , ‘four’; in xian, ‘appear’, x- is transcribed by h , the same 
as in xiong, ‘breast’. 

c. Finally, in the toponym ‘Linxia’, x- has been written using the letter shin. The 
j- consonant (again representing two MC initials) is usually written as kaf 
with three dots above it, but some aberrations have been observed as well: 
j- in xiaojing has been written with a minuscule letter ta’ placed above kaf; 
conversely, kaf with three dots, vocalised with kasra, , has also been used 
to represent d- in dian, ‘lightning’, and j- in jing (apparently pronounced as 
din in the Linxia dialect) has also been written as dal (-). 


5 7 Many such inconsistences may be accounted for by the fact that xiaojing represents a dialect 
variant which retains features of MC. 
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Nowadays, Arabic text editions accompanied by xiaojing in any of the variations 
shown above compete with formats where translations in characters are printed 
in the same line as the Arabic wording, and the impression is that at least in 
printed publications Arabic followed by Chinese translations this will eventu¬ 
ally become the standard. The fact that Arabic sacred text has occasionally been 
transcribed by Chinese characters used exclusively for their phonetic value indi¬ 
cates that even transliterations written in characters, considered more elegant by 
some, can prevail over xiaojing , 58 
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Fig. 10: The Da'awat al-muslimin (a collection of prayers). 


58 One example is a collection of prayers entitled Da'awat al-muslimm, published as a facsimile 
manuscript in 1982 by the Islamic Association of Beijing (Beijing shi yisilan jiao xiehui dtitff \iffi 
WtMW®HO (Fig. 10). The collection also includes a Chinese translation written beneath the line, 
from the left to the right (the phonetic transcription written above the line runs from right to left). 
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Jan van der Putten 

A Collection of Unstandardised 
Consistencies? The Use of Jawi Script in 
a Few Early Malay Manuscripts from the 
Moluccas 

Abstract: Jawi, a form of Arabic script in use in insular Southeast Asia, has been 
employed for writing Malay manuscripts for at least eight centuries (13 th -20 th cen¬ 
turies). Introduced and disseminated through Islamic canonical texts such as the 
Qur’an, it became the principal means of communication and knowledge pro¬ 
duction over the years and across the region. Studies of Malay palaeography and 
orthography have been rare and far between, and are mainly limited to colonial 
scholars designing the rules to be taught in the educational system and a number 
of surveys that record the orthography of certain texts. 

Inspired by a recent publication which concisely discusses the palaeography and 
orthography of 60 dated or datable Malay manuscripts, the following chapter will 
expand on some of the observations made by the scholars. I will discuss a number 
of 17 ,h -19 lh -century manuscripts originating from the Moluccas and compare some 
characteristics of their orthography with those of the Hikayat Tanah Hitu, a Malay 
historical narrative compiled in the mid-17"’ century. In this exploration of ortho¬ 
graphic consistencies and peculiarities in Malay manuscripts originating from one 
region, I argue that the ambiguity of Jawi was a welcome characteristic that could 
accommodate the needs of multilingual communities found in the Malay world. 


1 Introduction 

Throughout history, the writing of Malay has made use of a variety of scripts in 
concordance with the prevailing influence of a major belief system or hegemonic 
colonial regime: Hindu-Buddhist traditions made use of a southern Brahmi script 
that spawned a number of vernacular forms; Islam brought Semitic languages 
and the Arabic script; and European imperialism and ‘modernity’ brought Latin 
script, which would eventually dominate the written and printed communica¬ 
tions in insular Southeast Asia from the 19 th century onwards, even though Chris¬ 
tianity never became the dominating religion of the region. Insular Southeast 
Asia, where the Malay language was developed, spread and is now widely used. 


3 Open Access. © 2019 Jan van der Putten, published by De Gruyter. This work is licensed 

under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-009 
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is predominantly Islamic except for the northern part of the Philippines and some 
areas in eastern Indonesia. It is generally assumed that Islam gained its first firm 
footing in the 13 th century with the conversion of the ruler of Samudra-Pasai at the 
northern tip of Sumatra. Other polities in the islands soon followed, and a Muslim 
trading network in the region gradually developed, renewing ties and connecting 
with larger networks across the Indian Ocean. 

Islam not only introduced its canonical texts with their Arabic language and 
script, it also informed and enhanced contacts with other Muslim traditions such 
as Gujarati and Persian. These newly introduced traditions met with existing 
ways of disseminating texts and knowledge, with which they mixed in different 
ways according to local circumstances in very complex and multifarious proces¬ 
ses that took place over a long period of time. In the multi-linguistic environ¬ 
ment of the Indonesian archipelago, Malay was one of the vernaculars that had 
been used for a long time to communicate official matters of state and religious 
affairs and was apparently sufficiently known in 13 th -century North Sumatra to 
become the vehicle of transmission for the new religion. Although there are a 
few examples of Islamic texts represented in locally derived Brahmi scripts, the 
Arabic script seems to have soon taken over the dominant role in disseminating 
religious knowledge and also other, more secular topics such as business affairs. 

Expanding Muslim trading networks greatly stimulated the dissemination 
of Islam, its texts and the languages of transmission, Arabic, Persian and, most 
extensively, Malay. Due to geopolitical circumstances and reasons, the nodes of this 
network dispersed over a myriad of islands, all with their own local interests and 
traditions. Later, other Asian and European trading partners joined, and eventu¬ 
ally disturbed the balance within the network. Therefore, it is difficult if not impos¬ 
sible to discern a single and strong Arabic or Malay Islamic tradition that would 
encompass the whole region and spur further developments. Throughout history, 
there have been powerful centres that have extended great influence over a longer 
period of time in political and cultural terms, such as Melaka in the 15 th century, 
Aceh and Banten in the 16 th until the mid-17 th centuries, and Makassar in the 17 th 
century. However, the impact these power centres impressed upon a more gene¬ 
rally defined Malay culture was relatively short-lived and therefore limited in scope. 
Of course, the presence and gradual expansion of the influence of the Dutch East 
Indies Trading Company (Verenigde Oost-Indische Compagnie or VOC) also thwar¬ 
ted the establishment and continuation of a firm Malay Islamic tradition. 

Although we perhaps cannot speak of a strong, single Malay tradition, the 
Arabic script which seems to have become current in insular Southeast Asia in the 
16 th and 17 th centuries made it possible for narratives to be circulated within a cul¬ 
tural realm heterogeneously and temporarily delineated by a combination of trade 
agreements, religion and language. The Arabic script, developed to represent the 
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Semitic language in close connection with the writing down and distribution of 
the holy Qur’an and other liturgical and theological texts, was adjusted to enable 
the translation of these texts into languages of other language families. Probably 
based on a familiarity with similar adjustments made for Persian and Indian langu¬ 
ages, five consonants were added for writing Malay (<p>: ■-*, <c>: g, <ng>: <ny>: 

<j and <g>: -S), while in the Javanese system an extra two characters were added 
to represent the retroflex stops /t/and /c|/. These modifications were realised by 
adding diacritical dots to the existing forms of the Arabic characters, e.g. the Malay 
voiceless palatal stop /tf/ is represented by the addition of three dots below the 
Arabic £ <h>. The set of Arabic characters to write Javanese is called Pegon. It is 
used next to locally developed Brahmi-derived as well as Latin scripts, while Jawi is 
the common name to refer to the alphabet used to write texts in Malay and several 
other regional languages. Palaeographical and orthographical studies of Jawi as 
used in manuscripts are rare, and were mostly carried out by colonial officials and 
missionaries who made it their business to disparage Jawi spelling. They argued 
that it was defective in terms of not representing the full range of vowels and, pro¬ 
bably more importantly for most of them, that it was connected to Islam. Inspired 
by such considerations, they made efforts to ‘force Jawi spelling into a Procrustean 
bed of presumed rules and standards’ (Kratz 2002,22). 

On the other hand, Ulrich Kratz (2002) also contends that it may in fact be 
this ‘defective’ nature of the Jawi spelling not fully indicating the vowels of a 
word which allows for a variation in pronunciation and writing, a characteristic 
well appreciated in a multilingual environment and conducive to joining people 
together in a common cultural realm. In a recent publication, issued as a tribute 
to Ulrich Kratz’s scholarship on Malay manuscript studies, Annabel Teh Gallop 
refers to this particularity in Malay spelling as a ‘tradition of inconsistency’ which 
may provide a good way of describing the impressions one gets upon reading 
Malay manuscripts (Gallop 2015, 29). 

In this chapter I would like to follow up on some of the points Kratz and Gallop 
have indicated in their recent publications and explore a few characteristics of 
the Jawi spelling as found in a small number of manuscripts originating from the 
Moluccas in the 17 th century. I must emphasise here that this is a first preliminary 
exploration of this kind, building on what has been displayed in A Jawi Source- 
book for the Study of Malay Palaeography and Orthography (Gallop 2015), which 
contains a concise survey of the writing in 60 dated Malay manuscripts spanning 
almost 350 years. In the next section, I will concisely discuss some earlier studies 
about Jawi spelling before going on to introduce and contextualise the manuscripts 
I explore in the remainder of this paper. As will become clear, the state of the art 
does not allow me to make any sweeping conclusions about standardisation of the 
Jawi script or the development of the orthography in Malay manuscripts. 
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2 Studies in Malay orthography 

With reference to the recent studies by the above-mentioned scholars of Jawi 
script, earlier studies about Jawi orthography in general have put much effort into 
determining how ill-equipped the modified Arabic script was for the representa¬ 
tion of Malay sounds. Moreover, they sought to prescribe certain rules that would 
improve the deplorable state of affairs and make the Malays write their language 
in a proper manner (see Kratz 2002 and Gallop 2015). In an elaborate survey of 
the history of the scripts used for writing Malay, Hashim Musa (1997) understan¬ 
dably focuses on a description of the Jawi script which to a certain extent is still 
being used and propagated in certain Malaysian quarters. His book presents long 
lists of spelling particularities that were gleaned from a wide array of Malay texts, 
ranging from a number of early inscriptions on tombstones and the earliest evi¬ 
dence of Jawi script in the region on the Trengganu Stone (dated 1303 CE) to a 
selection of manuscripts from the 16 th up to the 19 th century, to the spelling of a 
few books, treatises and news items in periodicals published in Malaysia in the 
20 th century. In a concise discussion of earlier studies on this topic, the author 
simply reverberates a well-known proposition made by the British missionary- 
cum-scholar William Girdlestone Shellabear (1901) that the relatively fixed stan¬ 
dard of the writing in early Malay manuscripts was due to the recent transfer 
of the script from Arab teachers to Malay pupils. This standard for the writing 
of Malay is supposed to have gradually faded in subsequent periods as more 
local particularities seeped into the writing system. These local particularities 
mainly pertain to the representation of a few consonants, such as the opposition 
between / and p, g and glottal stop, and especially the writing of vowels in the 
words. Hashim Musa also refers to a study by the Korean Kang Kyoung Seok, who 
proposed the thesis that the Jawi spelling had gone through three stages in its 
development for the purpose of representing the Malay language. The first phase 
is characterised by a faithful rendering of Malay sounds by applying an Arabic 
standard with a full set of diacritics, including vowel points. In the second phase, 
this standard was gradually left behind and vowel points were replaced by a set 
of semi-vowels comprising alif, waw and yd ’. These represent the vowel pairs a/a, 
o/u and i/e (e, e), which were inserted in the penultimate syllable of normal two- 
or three-syllable words in Malay which contains the word stress. 1 The third phase 
was characterised by the inclusion of these semi-vowels also in the ultimate syl¬ 
lable, a feature which is visible in the standard Jawi spelling systems developed 
in modern-day Malaysia (Hashim 1997, 78-80, Kratz 2002,23). 


1 For a discussion about the alif used to represent /a/, see below. 
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As a general rule of thumb, the most obvious characteristic of the second 
phase often found in Malay manuscripts produced between the 16 th and 19 th cen¬ 
turies is the spelling of common words such as jadi ‘to be, become’ as A*. <jad>, 
and orang ‘human being’ as jj' <awrng>. 2 However, this feature of not indicat¬ 
ing vowels in certain positions and the use of a set of ambivalent indicators (i.e. 
the use of semi-vowels), or even the total absence of any indication of vowels, 
which regularly occurs in the spelling of certain words, 3 charges the written texts 
with a certain degree of ambiguity which has led to severe criticism of past and 
present scholars and other observers imbued by a print-literate-inspired longing 
for unambiguousness of written texts (cf. Milroy and Milroy 1991, 64-69). Such 
criticism, of course, tacitly ignores similar inconsistencies in spelling and other 
linguistic peculiarities which were common in European written texts from an 
earlier period when mass education was not yet established and the standardi¬ 
sation of language usages was much less developed than it is now. Written com¬ 
munication is usually considered superior to speech in Western civilisations and 
by consequence applied as a yardstick to measure cultures in other parts of the 
world. However, two caveats must be brought forward with respect to ‘meas¬ 
uring’ Malay writing in accordance with such a ‘universal’ truism. Besides the 
fact that we are dealing with texts which were written down during a time when 
virtually no structured formal education existed in the Malay world, it may also 
be conceived that there are languages in which a certain degree of ambiguity in 
texts may be more appreciated than a formal, unambiguous form of communi¬ 
cation. Such a consideration was proposed by Geoffrey Benjamin with regard 
to the informality and situatedness of colloquial Malay, which cherishes a high 
degree of ambiguity and invites interlocutors to join the ‘language play’. This 
informal or condensed variety of the language shares certain important char¬ 
acteristics with the Malay used in narratives recited for local audiences, such 
as a-historicity, a lack of interest in the relations between events, and a down¬ 
playing of individuality. Benjamin describes this condensed form of Malay as a 
system in constant flux, where we are faced with contextual and fleeting shifts 
in cultural meanings projected by native speakers on the linguistic forms being 
used (Benjamin 1993, 355-56). 


2 Words starting with the vowels i/e or u/o are introduced with the alif-ya’ and alif-waw res¬ 
pectively. 

3 This commonly leads to consonant frameworks which can accommodate different vocalisa¬ 
tions. The framework 3^ (s-n-t-q), for instance, is interpreted in Klinkert’s Malay-Dutch dictio¬ 
nary (1947, 603-4) as santak (‘to thump’), ssritak (‘to puli’), santuk (‘sleepy; to accidentally hit 
something’), suntuk (‘being obstructed’), sintok (‘tree species: cinnamomum sintoc’), sintik 
(‘type of small oyster’). 
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A similar proposition is made by Ulrich Kratz, who calls the major perceived 
shortcoming of not indicating all vowels of the words in what is to a certain extent 
a received Jawi spelling a ‘major strength’. Part of his attractive contention is that 
Jawi had established a ‘shared written link’ among regions and dialects (Kratz 
2002, 23). This kind of written link seems to have existed in particular between 
Malay and Minangkabau cultural realms in Sumatra, whose languages share a 
large number of lexical items but differ in their pronunciation. In the case of such 
similar languages it seems quite straightforward to conceive that the inherent 
ambiguity of linguistic expressions in the writing system of these languages was 
considered to be a positive feature, as it marks a more open venue for commu¬ 
nication than a form rigidly controlled by non-negotiable rules for the spelling 
of particular words. The consonant framework (s-n-j), for instance, allows 
speakers of these languages to realise a different pronunciation, [sanja] and 
[sanjo] for Malay and Minangkabau respectively, which in the oral/aural mode 
of communication may not be readily recognised by them as one and the same 
word. However, its visual representation in written form is identical in Arabic 
script and therefore the meaning is readily understandable for speakers of both 
languages as ‘dusk’. 

A second caveat we need to make in this respect concerns the disregard of a 
differentiation of certain textual genres and the possible existence of more stable 
local traditions that produced Malay texts in which a higher degree of consistency 
was applied than general statements allow for. As the accuracy of Malay scribes’ 
copying of doctrinal Islamic texts is generally considered higher than in more 
secular texts (see e.g. Voorhoeve 1964), it seems likely that the former textual 
genre will show a higher degree of consistency in spelling in comparison to the 
latter. We should also be aware that, although Malay was the language of state 
and religious learning from the 16 th century onwards in many parts of the archi¬ 
pelago and beyond, people used different local languages in daily conversation 
in polities such as Aceh, Makassar, Bima and the Moluccas. These distinct polities 
were nodes in Islamic trading and linked scholarly networks; here local traditions 
may have developed their own particularities in terms of textual layout, shape 
and spelling in manuscripts produced at the courts of the rulers and other centres 
of learning. The study of these kinds of regional traditions found in clusters of 
related and datable documents may be a more fruitful approach, leading to a 
certain understanding about developments in the palaeography and orthography 
of Jawi writing, as Annabel Teh Gallop convincingly proposes in her recent publi¬ 
cation. As preliminary examples of such traditions she mentions a few characte¬ 
ristics of an Acehnese religious book hand which was current in religious writings 
in the 18 th and 19 th century, and a Moluccan chancery hand found in documents 
from the royal courts in the Moluccas (Gallop 2015, 34-36). It is this notion of a 
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local tradition in the production of manuscripts I will engage in here and explore 
to what extent it can be applied to a number of manuscripts that originate from 
the Moluccas and the royal court of Makassar on the island of Sulawesi. 


3 Malay documents from the Moluccas and 
Makassar 

Although it was already widely used as a trading language in eastern parts of 
the archipelago at an early stage, Malay was not the only or the most dominating 
language of the region. In the 17 th century, Malay had to compete with Portuguese, 
Spanish and Arabic as language of diplomacy; it was used in combination with 
Arabic in the dissemination of religious knowledge; and it also served as contact 
language next to a welter of local languages in daily conversations and trade 
negotiations. Because of the invaluable spices grown in the Moluccan islands, for 
centuries the region had been a trading hub which attracted a host of merchants, 
priests and soldiers from the archipelago, such as Javanese, Makassarese and 
Butonese, and from other parts of the world, including Arabs, Chinese, Indians 
and Europeans. Quite a few minor local rulers tried to establish their own polity 
to surf the prosperous wave of increasingly globalised trading networks, which 
inevitably led to harsh competition and full-scale wars between local contenders 
who frequently called in the assistance of outside forces to decide the conflicts 
with their neighbours and other contenders. In the 16 th century the islands were 
alternately dominated by the northern polities of Ternate and Tidore. The former 
first secured Portuguese support for their political agenda, and later the Dutch 
helped the rulers to consolidate their power and prosperity into the 17 th century, 
ironically also making Islam the dominant religion in the islands, despite Por¬ 
tuguese and Dutch proselytising activities. Not long after its arrival, the Dutch 
VOC implemented a gruesome monopoly on the cultivation and trade of cloves 
and nutmeg by forcefully expelling all foreign competitors and playing regional 
powers off against each other, or killing them if they failed to comply with the 
new rules. 

It is in this highly volatile context that the documents I will discuss are set. The 
first and most important document is the Tale ofHitu (Hikayat Tanah Hitu). Hitu 
was a polity which had united 30 settlements on the north coast of Ambon, and it 
was jointly ruled by four prominent families. Imam Sifar Rijali, a learned member 
of one of these four ruling families, is reported to have written Hikayat Tanah 
Hitu while he was staying at the viceroy’s court of Gowa in Makassar. Because of 
severe reductions in the clove production imposed by the Dutch VOC, the govern- 
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ment of Hitu had declared a war that would rage in the central Moluccan islands 
with intervals during the 1640s and 1650s. Rijali had taken refuge in Makassar to 
ask the assistance of the Makassarese at Gowa as the main competitors of the Ter- 
natean overlords of Hitu. The rulers of Ternate were too much inclined to go along 
with Dutch rule. Subsequently, in the 1650s one of the factions of the Ternatean 
ruling family rebelled against this pro-Dutch stance of the sultan and open figh¬ 
ting broke out on the islands under the leadership of Majira, a distant member 
of the ruling family of Ternate. As one of the Hituese leaders, Rijali intended to 
restore the authority of his government. It was perhaps to provide documentation 
of the ongoing war to support his appeal or at the personal request of the enligh¬ 
tened viceroy of Makassar, Karaeng Pattingalloang, that Rijali compiled a prose 
narrative about the events and their historical context. 4 

Hikayat Tanah Hitu more or less chronologically deals with the early state 
formation in Hitu (1500-38), wars the Hituese waged on the Portuguese (1538- 
1605), the monopolisation of the clove trade by the VOC (1605-43), and Hituese 
armed opposition against the abolition of the Hituese government (1643-46). 
After writing the tale during his exile in Makassar, Rijali went back to the Moluc¬ 
can islands in 1653, where a copy of the work was made in the 1650s for another 
member of his family. It is this copy that was passed on into the hands of the 
well-known German-born merchant, botanist and historian Georg Eberhard 
Rumphius, who spent most of his life in Ambon. This copy then, possibly by way 
of the Dutch Reverend Valentijn, eventually ended up in Leiden. Both Rumphius 
and Valentijn, well-known contemporaneous commentators on Moluccan affairs 
and important contributors to its historiography, used the tale to mine informa¬ 
tion for the writing of their own histories of the region. In his monumental Oud en 
Nieuw Oost Indien (1724-26) Valentijn mentioned the tale as one of the texts being 
circulated within Muslim quarters. He interpreted this as an indication of the 
scholarship of the Muslim part of the Moluccan population, whose knowledge 
and ability to speak and write Malay he considered much better developed than 
the level of proficiency of the Christians under his tutelage. He was convinced 
that this better command of Malay by Muslims was due to the fact that they pos¬ 
sessed a number of Malay writings, which they lent to each other to read or copy. 
Among the other texts he found in Ambon, Valentijn listed works which are now 
considered as classics in Malay traditional writing, including Sulalat al-Salatin 
or Sejarah Melayu, containing historical tales focusing on Malacca; Hikayat Amir 


4 For a full account of the historical context and a description of the manuscript containing this 
text, see Stravers, Van Fraassen, and van der Putten 2004. The unique manuscript is preserved 
under Cod. Or. 5448 in Leiden University Library. 
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Hamzah, a tale about Prophet Muhammad’s uncle; and Hikayat Nabi Muhammad 
and Hikayat Nur Muhammad, two tales about the Prophet Muhammad. 

Hikayat Tanah Hitu is one of the very few relatively old text examples we have 
from eastern Indonesia, which adds to its importance for manuscript studies in 
general and the study of the development of the Jawi script in particular. The 
manuscript originally consisted of 53 folded folio pages which seem to have been 
bound with thread in quires. Since most of these pages are torn in the fold, the 
manuscript now almost exclusively comprises loose pages, measuring about 32 
by 20 centimeters. The beginning of the tale is missing and the pages have been 
numbered using Arabic numerals from 2 up to 107 by one of the later owners 
or users. The first 80 pages each contain 17 lines in orderly Arabic characters, 
whereas the script in the remaining pages is less regular and these pages contain 
15 to 17 lines. The manuscript held at Leiden University Library is still clearly 
legible, but the paper is rather worn and tattered at the edges. Even though the 
manuscript was probably originally bound, there is no sign of any covers or end¬ 
papers. The first and last pages are provisionally repaired with Japanese tissue 
and quite a few pages have greasy stains. 

The extant text starts with the end of what must be the initial episode of 
the tale, which indicates that not many pages have gone missing from this 
manuscript. A few doodles that embellish the pages which now serve as first and 
last page, indicating the beginning and end of the extant manuscript, suggest 
that any covers and the beginning of the text might have been missing already by 
the time the copy was made or passed on into Dutch hands. The remaining part 
is complete, since the last page of the manuscript contains the end of the text. 

The paper used for the copy of Rijali’s text reveals some indications about 
the age of the manuscript, as the last 14 pages contain a watermark (a fool’s cap) 
and a countermark (consisting of the initials for VOC). The typical fool’s cap 
watermark indicates that the paper dates from the second half of the 17 th century, 
whereas the countermark shows that the paper was ordered by the VOC. 5 There¬ 
fore the manuscript may be dated to the second half of the 17 th century and is 
among the oldest Indonesian manuscripts extant. This is in contrast with most 
of the preserved Malay manuscripts which contain copies of texts made during 
the 19 th century for successive generations of owners, as climatic conditions in 
the tropics render paper a highly perishable medium for the distribution of texts. 


5 The paper shows a fool’s cap with a seven-pointed collar of a kind which is likely to have been 
manufactured in the second half of the 17 ttl century, while fool’s caps with five-pointed collars are 
generally from the first half of that century (see Laurentius and Laurentius 2008, 2: vii). 
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Fig. V. First page of the Tale ofHitu (Hikayat Tanah Hitu ), Leiden University Library, Cod. Or. 5448. 
© Courtesy of the University Library Leiden. 


The Hikayat Tanah Hitu manuscript not only contains indications about its age, 
but also about its origin. In this respect a piece of calligraphy that appears twice 
in the manuscript deserves special attention: in the top margin of pages 77 and 78 
of the manuscript (shown in Fig. 2), we find the phrase Min Bulan Nustapi (written 
as <jL (> (m-n-b-l-n n-s-t-f-y) and meaning ‘belonging to Bulan Nusatapi’, 
the sobriquet of one of Rijali’s cousins). This may be interpreted as the hallmark 
or inscription of the original owner(s) of the manuscript; Nustapi or Nusatapi was 
the name of Rijali’s lineage. The manuscript also offers indications that it was a 
copy of an older one, perhaps even of the original text by Rijali. At certain inter¬ 
vals in the manuscript we find a word in the margin that has served as a sign for 
someone to indicate that the reading during a certain session had come to that 
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particular point in the text. Although quite common in the Islamic tradition of the 
Middle East, this Arabic word, balagh (jh; ‘reach, transmit, report’), is not a mark 
that appears frequently in Malay manuscripts. In this manuscript it seems to indi¬ 
cate the reading by an authoritative reader who compared the copy with the orig¬ 
inal and in the end gave his approval to the copy. This approval is found on page 
92, where the word sahh (^-=>; ‘authentic, acknowledged, legal’) is written in the 
margin. These marginal notes of balagh and sahh suggest that a member of the 
Nusatapi family carefully checked the manuscript, gave his approval to the copy 
and eventually inscribed the name of his family as an indication of ownership in 
the manuscript. 



Fig. 2: Inscription of the name of the original custodians of the manuscript; pages 77 and 78 of 
the Hikayat Tanah Hitu, Leiden University Library, Cod. Or. 5448. © Courtesy of the University 
Library Leiden. 


As indicated before, there are not many Malay documents that have been preser¬ 
ved from this period and region, but some scattered manuscripts may serve as 
extant material with which the orthography and some palaeographic characteris¬ 
tics of the handwritten Tale of Hitu can be usefully compared. These documents 
include a manuscript preserved in the Staatsbibliothek in Berlin which contains 
(1) a (fragment of an) undated letter from one of the Hituese leaders probably com¬ 
piled in connection with one of the other texts in the manuscript, namely (2) an 
agreement between the Ternatean Sultan Mandar Syah and the Dutch Governor- 
General of 1652; and (3) the text of an agreement between the Dutch and Sultan 
Hamza in Ternate from 1638 (Staatsbibliothek, Ms. Or. Fol 409, 1-3). 6 The other 
documents comprise a letter from a Ternatean leader in exile in Makassar, Kime- 


6 The text of the letter seems closely related to the problems surrounding the visit the Ternatean 
Sultan Hamza paid to Hitu and the consequences of the new political relations as imposed by 
the VOC in 1637 (see Stravers, Van Fraassen and van der Putten 2004, 61-62; 178-85). I surmise 
that the letter originally may have been an attachment of the agreement between Sultan Hamza 
and the VOC. 
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laha Salahak Abdul Kadir ibn Syahbuddin, to the British East India Company 
dated 23 May 1658 (Gallop and Arps 1991, 38), and two handwritten narratives 
copied in Ambon at the beginning of the 18 th century as presented in A Jawi Sour¬ 
cebook by Vladimir Braginsky (2015, 54-55; 62-63). As Annabel Teh Gallop notes, 
even a cursory comparison between these documents and two early-19 th -century 
letters from Ternate and Tidore she scrutinises for the same publication (Gallop 
2015, 82-85) already shows that the handwriting of these documents differs mar¬ 
kedly from the ‘wispy and spidery’ hand and specific form of some letters that 
are characteristic for the Moluccan chancery hand (Gallop 2015, 36). Below I will 
discuss a few palaeographic and orthographic characteristics of the writing in 
the copy of the Hikayat Tanah Hitu and indicate similarities with the other extant 
Malay documents mentioned above. 


4 Characteristics in the writing of the Tale of Hitu 

Taking a glance at the mise-en-page of the first page of the Hikayat Tanah Hitu 
text (Fig. 1), it is obvious that the text block takes a central position on the page 
with three wider and one smaller margin, the narrowest margin being on the side 
of the paper where the manuscript is bound. This layout is consistent throughout 
the manuscript, and we can see that the first page of the manuscript with the 
doodles in the left-hand, wider margin is the recto side while a possible title page 
(first recto) and start of the text (first verso) are missing (see Fig. 1). 

The writing is regular but not refined. The most obvious characteristic is 
the rather elongated, slightly slanted top stroke of the letter kaf ( k/g ) above its 
straight upright ‘trunk’. 7 While the tails of certain letters, particularly the ra’ and 
wdw, are often nicely rounded and elongated, the handwriting overall does seem 
to be quite common and does not show any distinct characteristics which would 
set the manuscript apart from the bulk of other Malay manuscripts. 

The orthography of some of the words is arguably more specific to this text or 
copyist, and to a certain degree also represents the conventions of the period and 


7 See Fig. 1, but it may be even more obvious on other pages, such as the page of the manuscript 
which was included as an illustration in van der Putten 2015, 51. The <g> can be distinguished 
from the <k> in the Jawi script by placing one or three dots above or below the character kaf (<-£). 
In Malay texts, these dots quite often are omitted, especially in common words such as juga 
(‘also’; spelled [j-w-l<]), but in Hikayat Tanah Hituthe opposite idiosyncrasy is applied by 
adding dots to the kaf in some words, even though these are normally spelled with a <k>, as for 
instance in the word koluarga (‘siblings, family’) spelled [g-w-l-w-r-g-a]. 
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place where the copy was made. What must be mentioned first and foremost here 
is the use of the tashdid or shaddah, a diacritical mark to strengthen and normally 
double the consonant in Arabic, which takes the form of small ‘w’ on top of the 
enhanced character. The borrowing of this diacritic in Jawi orthography has trig¬ 
gered quite a number of comments from scholars’ earlier studies of Malay texts, 
which are described and discussed by Russell Jones (2005). The most common 
explanation for the use of the tashdid in Jawi orthography is that it would indicate 
a schwa (a mid central vowel [a]) in the preceding syllable. As there is no specific 
diacritic in Arabic script for the schwa, most frequently referred to with the Java¬ 
nese name pepet in Malay studies, an alif may possibly be used to indicate this 
vowel. However, as mentioned above, the half-vowels alif, waw and ya\ used in 
Malay spelling to indicate the vowel, are normally included in the penultimate 
syllable which contains the word stress, whereas syllables containing a pepet 
most often are not stressed. The convention of the use of the tashdid in these 
cases seems logical, because geminating a consonant does seem to have an effect 
on the quality of the preceding vowel (cf. Khattab and Al-Tamimi 2008). As has 
been noted by several scholars, a similar method of geminating the following 
consonants to indicate a schwa is also found in Old Malay and especially Java¬ 
nese inscriptions from South Sumatra and Central Java, which were written in a 
script derived from the southern Brahmi script (Jones 2005, 281-2). Mahdi notes 
that for the Brahmi script used in Old Malay inscriptions, consonants are gemi¬ 
nated following a prefixal -r, such as in marvvahun (‘to rise’), while in root words 
it is only once found in the Old Malay inscriptions found in South Sumatra in the 
cognate for Malay bdtum (‘bamboo’, pattum in the inscription), but occurs more 
frequently in later inscriptions (see Mahdi 2005,187-8). 

Although logically we tend to look at Arabic phonology to describe the 
tashdid’s function and effect with regard to Malay orthography, most interestin¬ 
gly the indicated function of the diacritical mark for Malay is in agreement with 
James Collins’s comments about the historical development of a Malay dialect in 
eastern Kalimantan. In a concise overview about these dialects, he notes that the 
Proto-Malay pepet in Berau Malay has merged with /a/, and that the gemination 
of consonants following a penultimate syllable which originally contained *a is 
historically related to this merger (Collins 2006,39). 

Although he states that these characteristics are not shared by Kutai Malay, 
the major other dialect in the region, the examples Collins gives for the Malay 
dialect of Berau mirror the system of Jawi spelling as we find it in the Hikayat 
Tanah Hitu and other texts and may indicate a common characteristic in Austro- 
nesian languages in which consonants are geminated under influence of a schwa 
in a preceding syllable. 
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In discussions about the inclusion of the tashdid in Jawi manuscripts, its 
use is usually considered as being an indication of the relatively old age of the 
manuscript, but as Russell Jones (2005, 289) has noted, this interpretation is not 
absolute, as we can also find tashdids in 19 th -century manuscripts. This being the 
case, however, I think that a frequent use of this diacritical mark in original Malay 
words does represent a somewhat archaic tradition which can tell us something 
about the age of the particular manuscript. I would argue that this is also the 
case with regard to the manuscript under discussion here and that, to a certain 
degree, the use of the tashdid indicates a certain convention which was current in 
the Malay world, more specifically in the eastern archipelago, in the 17 th century. 
In the Hikayat Tanah Hitu, then, we find a rather frequent use of the tashdid on 
the following consonant after a pepet in a penultimate syllable of words such 
as bdnar (‘right, correct’, spelled [b-n w -r]), bdrlolahi (‘to fight’, spelled ks y 
[b-r-k-l w -a]) 8 , tdtak or mandtak (‘to slash’, spelled ^ [m-n-t w -‘]), mdnang (to win, 
spelled [m-n w -ng]), sri (honorific title in combination with sultan, spelled Sy- 
[s-r w -y]). This use of the tashdid occurs a few times in the other documents orig¬ 
inating from the same time frame and region, for example, tatak and bdsar with 
a tashdid on the second consonant is found in the fragment of an undated letter 
preserved in the Staatsbibliothek in Berlin (MS Or. Fol. 409-1, see Fig. 3), while in 
the Malay contract from the same collection of manuscripts (MS Or. Fol. 409-2,3), 
the tashdid only occurs systematically in the word dia (third person pronoun sin¬ 
gular, spelled Is- [d-y w ]), which was possibly done to distinguish the word from 
the preposition di (‘in, at’). The page of the manuscript copy of the Tale oflsma 
Yatim (Hikayat Isma Yatim) which illustrates Vladimir Braginsky’s concise notes 
on the same (Braginsky 2015,55) yields a better crop of tashdids: in the first seven 
lines we find these diacritical marks used in the words maplis (‘council’, spelled 
[m-j-l w -s]), sekalian (‘all’, spelled [s-k-1 w -y-n]) and duli (‘dust’, part 
of a formula referring to the king, spelled ck [d-l w ]). The latter spelling, which 
also occurs regularly in the Hikayat Tanah Hitu, may be connected to the San¬ 
skrit origin of the word, while the other two both may indicate the schwa in the 
preceding syllable. 


8 This spelling occurs a few times in the Hikayat Tanah Hitu, sometimes without a tashdid and 
all of them without a final yd’. The undated letter in the Staatsbibliothek contains twice the same 
word spelled with the final yd’ and without a tashdid (g[b-k-l-a-y], see Fig. 3, beginning of 
line 11 and 12). 
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Fig. 3: Fragment of the undated letter from Hitu, probably late 1630s, Staatsbibliothek Berlin, 
MS Or. foi. 409-1 10 . © Staatsbibliothek zu Berlin - PreuBischer Kulturbesitz, Orientabteiiung. 


The same page of the Tale oflsma Yatim also shows another characteristic which 
may be described as a spelling convention it shares with the writing of the Tale of 
Hitu. In line 4 of the page displayed in Braginsky’s description (Braginsky 2015, 
55), we find the word Jam's (‘sort’) spelled as (finis), where the insertion of 
the half-vowel in the penultimate syllable may be observed as indication for the 
vowel of the final syllable. The same spelling method of writing the vowel of the 
ultimate syllable in the penultimate syllable which contains a schwa occur in 
a few instances in Hikayat Tanah Hitu, in words such as /can's (‘dagger’, spelled 
<_>“[k-y-r-s]), bdlum (‘not yet’, spelled ^ji [b-w-l-m]) and Japun (‘Japan’, spelled 
» [j-w-p-n]). In a few instances in the Tale of Hitu we find pad (‘trunk, coffin’) 
spelled as Jd* [p-y-t w -y], in which the convention of writing the assimilated half¬ 
vowel is found in combination with the inclusion of the tashdid on the following 
consonant. This spelling convention is restricted to a few words only, and it is 
obscure what might have triggered it. It seems clear, however, that representing 
the schwa in Malay texts in the Moluccas caused some problems, possibly related 
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to specific traits of local languages, and Malay dialects of the region generally 
lack this phoneme in their phonological system. 9 

Besides these regularities which are to a certain extent shared with some other 
texts from the early period of Malay writings, Hikayat Tanah Hitu also shows some 
particularities which may be considered idiosyncrasies of the scribe, possibly 
induced by the spoken vernacular, while others are rather commonly found in Malay 
manuscripts. These spelling particularities concern a somewhat regular, albeit cer¬ 
tainly not consistent, omission of the voiceless glottal fricative /h/ in initial, medial 
and final positions, while in other instances the h is added in words which do not 
originally contain it. For instance, on the first page of the text in lines 6,7 and 8, we 
find respectively mdmbawah, dibawah ('to bring’) and labuan ('mooring’), which 
in a perhaps more common orthography would be spelled mombawa, dibawa and 
labuhan (see Fig. 1). Apart from this very common characteristic which is also found 
outside Old Malay manuscripts and the Malay language, we find a particular confu¬ 
sion of certain nasals and the omission of a glottal stop, most frequently before the 
suffix -lean is attached to the words. Some examples in the Tale of Hitu comprise 
buankan (‘to discard, exile’, instead of buangkan ), sampang (‘small boat’, instead 
of sampan), Seran (name of an island, instead of Seram), dinaikan (‘to rise, install’, 
instead of dinaikkan), enda (‘to want, will’, instead of hendak) and anaku (‘my 
child’, instead of anakku). Both these characteristics may have been influenced by 
a vernacular language or a local dialect of Malay. 

The use of vowel points in the Arabic script in the Tale of Hitu is mainly 
limited to the spelling of Arabic words and a few proper names, such as ja 
[f'-r--dy--ry L k- hw u -t--m a -n-] ifirdirik hutman: Frederik de Houtman). In a 
few other instances diacritics are provided to indicate the exact spelling of the 
word, which may be due to a lack of familiarity with the word on the part of the 
writer or copyist or due to his intention to highlight the word for another reason. 

If the spelling may be considered rather inconsistent at times in this 
manuscript, the opposite can be said about the morphology, which quite closely 
follows common practice of most of the texts comprising the bulk of extant clas¬ 
sical Malay narratives. 11 One of the basic characteristics of Malay morphology of 
transitive verbs is the assignment of roles to agent and object through the use of 


9 The local Malay dialect, Ambonese Malay, does not have a schwa (Collins 1980, 18), while in 
the local Austronesian languages, the schwa was changed into other vowel sounds (see Strese- 
mann 1927, 95-100). 

10 I am very grateful to Annabel Teh Gallop for providing me with the photographs of the ma¬ 
nuscript held in Berlin. For a description and illustrations of these documents, see Wieringa and 
Hanstein 2015, 62-65. 

11 In this category of traditional Malay narratives I do not include doctrinal Islamic treatises, 
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a specific form of the verb: when the action is viewed from the agent’s perspec¬ 
tive, the verb will be prefixed with ma- and a nasal which is homorganic with the 
initial sound of the verbal root. The nasal will precede the initial voiced sound of 
the root or will replace the initial sound if it is voiceless. For instance, the root 
buang will change into mdmbuang (‘throw away’), while panggil will transform 
into mdmanggil (‘to call’). It has been noticed that this general rule of prefixing 
voiced sounds is relatively new and that the occurrence of deviating forms in 
texts is an indication of their age (see Jones 2005). In older texts, we occasionally 
find forms such as mamunuh instead of mamhunuh (‘to kill’) and rridmuat instead 
of mdmbuat (‘to make’), but the form manangar (and not mandangar, ‘to hear’) is 
quite persistent and commonly found in texts from the 19 th century as well (for an 
early example see Braginsky 2015,55, line 2). 

The text of Hikayat Tanah Hitu somewhat consistently follows this general rule, 
which forms an indication that the author and/or copyist was well acquainted with 
the rules of the formal written register of this language. In prefixing the active verb 
marker ma N-, hardly any deviating forms emerge compared to common practices. 
The text furthermore frequently and consistently uses the morphological possibilities 
available in Malay grammar to indicate a reciprocal action. In traditional Malay texts, 
the most obvious instance for this form is in the frequent war or fighting scenes in 
which the adversaries shoot, stab, hack or curse at each other. This is also the case in 
Hikayat Tanah Hitu, which contains forms such as samng-manyamng (‘to attack each 
other’), alah-mengalah (‘to defeat each other’) and tembak-manembak (‘shoot at each 
other’) as examples in which the second part of the reduplication is affixed while the 
first part consists of the root of the verb. 12 Another way to indicate reciprocity is by 
using the circumfix bar-...-an with possible reduplication of the root, which we find in 
examples such as barjanji-janjian (‘promise each other’), bartikam-tikaman ('to stab 
at each other’) and barsumpah-sumpahan (‘make a vow to each other’). 

Only in a few single instances do we find examples of more archaic gram¬ 
matical forms, such as dipapatutan (‘to put in order’, modern Indonesian 
dipatutkan ) and dibabohonkan (‘to lie about something’, modern Indonesian 
dibohongi). 13 These forms contain a partly reduplicated root form, which is 
extinct in modern standardised Malay. 


which seem to follow other standards heavily influenced by the Arabic originals they were trans¬ 
lated from or based on. 

12 Sarang-manyarang is spelled with alif in the penultimate syllable, while tembak-menembak 
is spelled with a final -h (») instead of a normal qaf{ J). 

13 Again in the Hikayat Isma Yatim, copied in Ambon, we find a similar irregular form, 
baparsambahkan (‘to present’; Braginsky 2015, 55, line 11), while a few lines further down we can 
find the regular passive form of the word in diparsambahkan (ibid., line 17). 
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5 Conclusion 

I need to emphasise that the limited scope of this paper only allows for a perfunc¬ 
tory and preliminary discussion of a topic as broad as the use of Jawi spelling in 
Malay manuscripts during a period of over three centuries and a distribution over 
such a vast region. I have given a short survey of points brought forward in previ¬ 
ous studies and described some palaeographic and orthographic characteristics 
of the writing found in a manuscript copied in the mid-17 th century in Ambon, 
which I compared with the writing in a few documents originating from the same 
period and region. 

Older discussions about Jawi orthography mainly deal with the ways in which 
vowels are represented, since Arabic usually indicates only three long vowels. I 
have given specific attention to the tashdid, frequently occurring in the text of the 
Hikayat Tanah Hitu and in some of the other extant Malay documents, demonst¬ 
rating a usage which may be in agreement with certain historical developments 
in geminating consonants and a merger of the schwa into /a/, as noted by James 
Collins for Berau Malay. 

Furthermore, the inconsistencies in the spelling of words I have touched on 
may reflect phonological characteristics of vernaculars and local dialects which 
are accommodated by the Jawi script, thereby providing a written link bridging 
the different repertoires of communication. It does not seem too surprising that 
the majority of correspondences in the use of tashdid in the Tale of Hitu were 
found in texts of the same genre of extended narratives which were also circu¬ 
lating in the Malay world by way of mouth, next to their written form preserved 
in the extant manuscripts. In the vast cultural realm of the Malay world it may 
be difficult to find a homogeneous body of texts that provide many indications 
of standardised traditions, but collections of unstandardised consistencies as 
accommodated by the Jawi script have certainly proven to be sufficient to com¬ 
municate through time and space and provide the ambiguity appreciated by 
interlocutors with different linguistic backgrounds. This shared cultural feature 
of different literary traditions from such a vast geographical space has certainly 
assisted in considering their ensemble as one ‘Malay’ tradition, however hetero¬ 
geneous this may be. 
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Dmitry Bondarev and Nikolay Dobronravin 

Standardisation Tendencies in Kanuri and 
Hausa Ajami Writings 

Abstract: The Kanuri and Hausa manuscript cultures had a standard model of 
the Arabic writing system whose uniformity is grounded in Arabic literacy in its 
Qur’anic and Classical forms. The scribes of both cultures (Kanuri starting from 
the mid-17 th century and Hausa from the early 19 th century) adhered to the prin¬ 
ciple of total orthographic uniformity in writing the Arabic texts but they used 
variable orthographic systems for writing in local languages (Ajami). Having been 
in contact for a long time, Kanuri and Hausa manuscript cultures share a similar 
type of Arabic script and belong to the Central Sudanic area of Ajami writing cha¬ 
racterised by specific graphemic choices for some sounds. 

Both Kanuri and Hausa orthographies developed from conservative simplified 
systems strictly modelled on Arabic letters to a more elaborated encoding of 
sounds. However, unlike Hausa writing of a later period, Kanuri tradition remai¬ 
ned largely unaffected by graphemic innovation. Both cultures have identifiable 
sets of grapheme-phoneme combinations which were stable within a restrictive 
range of the phonemic and graphemic inventories. From a diachronic perspective, 
Hausa writing shows a tendency towards a closer match between the number of 
phonemes and graphemes, whereas in Kanuri there is a tendency of the retention 
of the spelling of some high-frequency lexical and grammatical items. 


1 Introduction 

The expression ‘standardisation tendencies’ in the title of our chapter might 
trigger unintended connotations, both positive and negative. On the positive side, 
‘standard’ is usually equated with ‘stability’ and ‘norm’, and standardisation is 
customarily seen as a process of reaching an orderly state out of the chaos of 
variation. The development of a standard orthography is seen as progress, and 
as an explicit expression of widespread assumptions ‘about the empowerment 
and liberating effects of literacy’ (Blommaert 2004, 645). Various attempts by 
UNESCO to standardise orthographies for the world’s languages speak vividly for 
the assumed high value of a standard. 

On the other hand, the same title may be read negatively by scholars who 
challenge and problematise the ideologies of ‘graphocentrism’ which dominate 


3 Open Access. © 2019 Dmitry Bondarev and Nikolay Dobronravin, published by De Gruyter. 
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modern society whereby ‘a language is not seen as “complete” unless it has 
acquired a standard orthography’ (Blommaert 2004, 645). The expression ‘stan¬ 
dardisation tendencies’ may thus be understood as an attempt to highlight the 
‘evolutionary positive’ side of otherwise chaotic spelling, and to present writings 
in Kanuri and Hausa as systems aspiring to improvement by developing proper 
literacy. 

Discussing standardisation tendencies in this chapter, we neither defend 
nor deconstruct the assumed evolutionary unidirectionality of writing systems 
towards a standard form (albeit, as will become clear, our approach lies nowhere 
near any evolutionary model of orthography development). 

Our task is to document ranges of orthographic variation and uniformity and, 
where possible, identify their underlying causes in two writing cultures largely 
unaffected by European-driven standardisation policies. In doing so, we consi¬ 
der orthography as a system of graphemic combinations conditioned by various 
factors rather than as a fixed system of spelling rules. Thus, we avoid the more 
restrictive notion of orthography commonly defined as ‘the standardized variety 
of a given, language-specific writing system’ (Coulmas 2003, 35) or as ‘the set of 
conventions for writing words of the language’ (Sebba 2007,10-11). When dealing 
with diachronic and synchronic variation in a given orthographic system, we 
first outline a graphemic set for each phoneme of the language and then analyse 
which linguistic and extra-linguistic factors licence the selection of graphemes 
for such a set. For example, a phoneme /b/ can be written as <b, f, m, p>, but 
not as <s> or <k>. Or /g/ can be written as <g> or <k> but not as <b>, or /l/ as 
<d> or <1, r> but not as < g> or <b>, and so on. The sets of phoneme-grapheme 
correspondences may have different degrees of stability conditioned by linguis¬ 
tic domains, such as phonology, morphology, syntax, lexicon, prosody, genre, 
etc. For example, in various Kanuri Arabic-based orthographies, monosegmental 
and bisegmental consonantal sequences [nasal] - [plosive] (are typically written 
as single letters used for plosive consonants be it word-initial or word-medial 
(/ndara/ = <dara> ‘where’, /tfasundogi/ = <tasudugi> ‘they know’) but in a spe¬ 
cific morphological environment the nasal consonant may be written as a sepa¬ 
rate grapheme (/tfundogi/ = <tundugi> ‘he knows’). The nasal segment can also 
be explicitly written in the etymological spelling of words borrowed from Arabic 
(Bondarev 2014b, 128-132). 

Part of this approach involves tracing the development of such graphemic 
sets over time, and examining the components of an identified set as to whether 
they changed or were reduced to a one-to-one correspondence. Such chronologi¬ 
cal variations reflect possible phonological changes and/or cultural exchanges 
between different writing systems, be it in contact areas or under the hegemony 
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of a prestige orthographic system. Chronological graphemic variations may also 
account for the emergence of fixed restrictive conventions, or of a standard. 

From this point of view, we consider orthography not in terms of the ‘stan¬ 
dard versus nonstandard’ dichotomy, but rather as a combination of tendencies, 
not necessarily unidirectional. Thus, we can talk about patches of standard spel¬ 
ling within a system of internally organised sets conditioned by a multiplicity of 
linguistic and extra-linguistic factors. One set may be leaning towards a one-to- 
one representation, while the other may expand the scope of graphemes and thus 
become more variable. 

A good example of such changing orthographic practices in contact 
manuscript cultures involves the Arabic-based orthographies used in various 
historical periods for writing Kanuri and Hausa. This chapter is an attempt to 
compare the variation and consistency of general orthographic tendencies in 
Kanuri and Hausa manuscript cultures and to identify the graphemic sets parti¬ 
cularly prone to standardisation. 


2 Kanuri and Hausa: shared and different history 

The Kanuri and Hausa manuscript cultures have long been in contact with each 
other. Both were situated in the Sahelian region in what is now northern Nigeria 
and its neighbours to the north and east, and both grew out of early sub-Saharan 
Islamic polities. The Kanuri manuscript culture is rooted in the ancient Kanem- 
Borno whose ruling dynasty was one of the earliest in the Sahel to adopt Islam. 
The early Muslim elite society of Kanem can be traced back to the 11 th century, and 
from the late 12 th century onwards we find accounts of ‘considerable development 
of Islamic learning in Kanem’ (Hunwick 1995,16). The Arab written sources from 
the 12 th to the 16 th century as well as local 16 th century accounts about scholarly 
activity in Kanem-Borno can be taken as indirect evidence of the antiquity of the 
Kanuri manuscript tradition. However, the earliest manuscripts belonging to the 
Kanuri manuscript culture only go back to the 17 th century with one manuscript 
dated 1669 (Bivar 1960, Bondarev 2006,2014a). 

The legendary history of Islam in Hausaland starts with the arrival of foreign 
scholars such as al-Fazazi (d.626/1230) or al-Maghili (d.909/1503-4 or 910/1504-5) 
travelling through Borno or from ancient Mali (Lippert 1900; Palmer 1908; Star- 
ratt 1993). More Islamic books were reportedly brought to Kano by the Fulani 
from Mali in the reign of Yakubu, son of Abdulahi Burja (dated by Palmer as 
856-867/1452-1463 [Palmer 1908, 76-77]). Local written culture in Arabic, 
also linked with migrant scholars, already existed in Kano and Katsina in the 
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17 th century (for more details see Hunwick 1995). However, the earliest subs¬ 
tantial manuscript data related to Hausa goes back only to the first decades of 
the 19 th century (see e.g. Hiskett 1975, 18). Thus, the development of the Kanuri 
manuscript culture can be observed over a time span of about 350 years while 
Hausa manuscript culture is only observable over the last 200 years. This makes 
any comparative study of these two cultures slightly anachronistic. 

Such comparison is also complicated by historical interference created by 
the Western and Central Sudanic cultures between which the Hausa manuscript 
tradition developed. In the 15 th century, the Hausa cities Kano and Katsina came 
into the sphere of activity of the Wangara (Wangarawa) - the merchants and 
clerics who came from the west, from ancient Mali and Songhay polities. It may 
be argued that the Wangarawa in Hausaland belonged to the Jahkanke schol¬ 
arly lineages, ultimately stemming from the Soninke-speaking communities in 
what is now western Mali; however, their linguistic identities changed over time. 1 
According to one hypothesis, the Wangarawa who reached the Hausa city-states 
in the 15 th century spoke a Mande language (possibly Jula, see Al-Hajj 1968, Akin- 
wumi and Raji 1990); according to another, they spoke Songhay (Lovejoy 1978). 
Later, with the expansion of Borno in the 16 th century, the Hausa became incre¬ 
asingly exposed to Kanuri culture, including Kanuri administrative systems and 
their manuscript tradition, especially discernible in the script style. Sometime in 
the 18 th century, Hausa states again came under western influence, this time from 
the Toronkawa Fulani Muslim scholars among whose circles reformist ideas deve¬ 
loped into the jihad movements, ultimately resulting in the creation of the Sokoto 
Caliphate in the early 19 th century (Hiskett 1957, Levtzion 2000, 83-86). 

Given these connections with various scholarly Muslim communities, it is 
safe to suggest that the Hausa manuscript tradition developed in contact with 
cultures both to the west (Wangarawa, Fulani) and to the east (Kanuri), probably 
with varying influence from each side. For example, the most common Hausa 
script style (defined as ‘KanawT or ‘Hausawi’) derives from the Borno calligraphic 
tradition (Brigaglia and Nobili 2013), whereas some Sokoto scribes were appa¬ 
rently more familiar with the Western Sudanic and Saharan styles (for a tenta¬ 
tive classification see Nobili 2011). The techniques of glossing the Arabic texts in 
Hausa, e.g. the marking of glosses as ‘jm (Ajami), were possibly influenced by the 
Western Sudanic tradition, but this remains an open question. 


1 According to Lamin Sanneh, the family name (jamu) of ‘Abd al-Rahman Zaiti or Zagaiti, the 
head of the Wangarawa who came to Kano, should be read as Jakhite. Sanneh saw the Wangara¬ 
wa as a Manding-speaking community of the Jakhanke (Sanneh 1989, 32-35). 
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Interestingly, most of the surviving manuscripts recognised as originating in 
the Kanuri and Hausa cultures coincide with the earliest attestations of writing 
in these languages. Therefore, the material side of Kanuri and Hausa manuscript 
cultures is validated both on linguistic grounds and by the evidence of codicolo- 
gical and palaeographic features. Thus (if we disregard the question of the appli¬ 
cation of a literary language to wider cultural and regional domains), when we 
see Hausa or Kanuri in manuscripts, it is typically the most solid evidence for the 
existence of these manuscript cultures, whereas script style is much less indica¬ 
tive as a feature of attribution, and layout even less. This is still more relevant 
for Hausa, because extant Hausa manuscripts come from a vast array of cultural 
and sociolinguistic contexts ranging from the 19 th -century Caribbean and Brazil 
to West and North Africa. 


3 Kanuri literacy in Arabic script 

There are several sociolinguistic dimensions to the study of Kanuri texts in Arabic 
script, or Kanuri Ajami. The earliest written evidence of literacy in the area nor¬ 
thwest of Lake Chad comes from annotated Qur’an manuscripts dating to the 17 th 
century and produced in the Borno Sultanate, what is today northeast Nigeria and 
southeast Niger. The annotations were written in a distinct linguistic variety called 
Old Kanembu or Tarjumo, closely related to modern-day Kanuri (Bondarev 2013a, 
b, 2014a, b). Kanembu, as part of the term Old Kanembu, refers to a prominent 
dialect cluster of Kanuri spoken around northern, northeastern and eastern areas of 
Lake Chad. This was the region of the Kanem Sultanate, or Kanem - an influential 
Islamic polity established around the 12 th century. Kanem is considered to be a poli¬ 
tical and religious predecessor to the Borno Sultanate and the annotated Qur’anic 
manuscripts are the earliest surviving witnesses to literacy practices in the domain 
of Islamic education spread between the region of Kanem and Borno. The linguistic 
variety used in the annotations of the Borno Qur’ans took on the wider role of a 
specialised language used exclusively for the purpose of translating Arabic texts 
at various intermediate and higher stages of Islamic education. This tradition still 
survives in northeast Nigeria, albeit diminishing under the pressure of a growing 
influence of non-Kanuri speakers and of reformist-oriented religious leaders. 

There is some inconclusive evidence that Old Kanembu had developed into 
a classical language used for composition and unrelated to the translation func¬ 
tion. Thus, a five-line verse poem (visually resembling a classical takhmls style) 
in a variety of Kanembu found by Bondarev in Mao, Chad, in 2011 (Fig. 1) shows 
features which are typical of Old Kanembu in the Borno Qur’ans. However, there 
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Fig. 1: Poem in Kanembu on religious and secular duties. Mao, Chad, Mala Abbakar Abdala‘s 
collection. Courtesy of SOAS University of London, SOAS Digital Collections, Borno and Old 
Kanembu Islamic Manuscripts, qasida in Kanembu in wu'az genre. 
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are many features in the language of the poem which set this variety apart from 
both Old Kanembu and Kanembu dialects, giving the impression of a pan-dialect 
idiom that could only have been used for literary compositions. 

Another dimension of literacy in Kanuri Ajami has to do with writing in the 
western (non-Kanembu) varieties of Kanuri dialects. The earliest evidence for 
writing in the western dialects of Kanuri dates to the beginning of the 20 th century 
and it comes from historical records, mostly dealing with the lists of the Kanem 
and Borno rulers and a summary of their deeds. This genealogical and historical 
genre known in Kanuri as gargam/girgam might have existed in written form as 
early as the 1790s (Bondarev 2014a, 120-121). All known copies of gargams ori¬ 
ginate from the area of Borno, situated west of Lake Chad, and account for the 
use of (western) Kanuri dialects. To date, no written gargams from the east of the 
historical Kanem-Borno (in what is now Kanem province in the Republic of Chad) 
have come to light, and it is impossible to say whether the same gargam genre was 
written in a variety of Kanembu. 

A distinct corpus of writing in Kanuri consists of Ajami manuscripts commis¬ 
sioned by pre-colonial European explorers and colonial administrators. In terms 
of their relevance to the study of spelling conventions in Kanuri and Kanembu 
manuscripts, the most interesting are those Ajami texts written by those Kanuri 
speakers schooled in the traditional system of literacy based on Arabic and Islamic 
learning. This type of Ajami is represented in the short folktales published in 1911 
by Philip Askell Benton, a colonial officer (formally called Assistant Resident) 
who collected the texts some time at the beginning of the 20 th century (Benton 
1911; treated in Bondarev 2014b, 132,139-140). Ajami writings commissioned by 
Europeans and transcribed by scribes who did not belong to the Kanuri-speaking 
cultural area are of lesser relevance to the present study because these scribes 
had their own spelling conventions which were idiosyncratic or influenced by 
other cultural areas. The earliest such examples are Kanuri texts and word lists 
collected by James Richardson, an English traveller, during his expeditions to 
sub-Saharan Africa between 1848 and 1851 (Bondarev 2007, 67-68). Specimens 
of Kanuri Ajami written outside the Kanuri and Kanembu speaking cultural areas 
were also published by the German scholar Rudolf Prietze (1914,1930). 

In more recent times, Kanuri writing in Arabic script has remained visible in 
two distinct genres: translations of Arabic texts, and religious didactic poems. 
Translations are written in a variety of Old Kanembu, a continuation of the practi¬ 
ces found in the commentaries of the ancient Borno Qur’anic manuscripts. The 
Islamic didactic poems are written in a literary variety of Kanuri, possibly similar 
to the registers used in the gargams and folktales. The texts of both genres are 
circulated in printed market editions - cheap facsimile reproductions of hand¬ 
written originals. 
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3.1 Kanuri orthographic conventions 

Tendencies in the orthographic conventions of the Old Kanembu Ajami writing 
used in the Borno Qur’ans have been described in Bondarev 2006, 2007, 2014b. 
The present section gives a short summary of previous findings in the study of the 
orthography of Old Kanembu: consonantal graphemes and a quite detailed ana¬ 
lysis of the development of vowel encoding, followed by a presentation of general 
graphemic features that withstood the test of time and regional differences. 

The scribes who wrote in earlier varieties of Kanuri and Kanembu were con¬ 
servative users of the Arabic script adapted for the languages of West Africa. 
Unlike many other Ajami graphic systems which gradually developed special 
signs for their non-Arabic phonology (see a comprehensive list in Mumin and 
Versteegh 2014, 1-22), Ajami writing in Kanuri and Kanembu remained graphe- 
mically minimalistic even in the more recent manuscripts and facsimile editions. 
Thus, there have been no graphemic innovations throughout the whole history of 
Kanuri and Kanembu Ajami writing outlined in the previous section. 2 


3.2 Consonants in Kanuri Ajami 

One possible reason for graphemic conservatism is veneration of the Arabic script 
as the written emanation of God’s word - a factor difficult to prove for earlier 
manuscripts (due to the absence of written testimonies left by the Borno scribes) 
but impossible to rule out in a Muslim society. The question of veneration aside, 
the lack of graphemic innovations might plausibly be accounted for by the pho¬ 
nological system of Kanuri, especially in the domain of consonants. In the diffe¬ 
rent analyses, and depending on the Kanuri dialect, the number of consonants 
ranges from 19 (Cyffer 1998,19) to 21 (Hutchison 1981,17) to 25 (Lukas 1937), and 
24 in Mowar (Bulakarima 2001, 42). In Kanembu dialects, Jouannet (1982) pos¬ 
tulates from 20 to 26 consonants for Ngaldoukou Kanembu and Lukas (1931) 26 


2 There may be slight evidence to the contrary. In 1885, Jean Marie Le Roux met a Kanuri-spea- 
ker (from Kukawa) in Algeria who wrote several pages at his request. According to Le Roux, ‘He 
wrote from top to bottom. His writing bears a lot of resemblance to Arabic writing; but it should 
be noted that it employs a greater number of letters than does the Arabic alphabet’. (Le Roux, 
1886, x, fnl) (Ce negre ecrivit, en ma presence, quelques pages que j’ai conserves. II ecrivait 
de haut en bas. Son ecriture a beaucoup de ressemblance avec l’ecriture arabe ; mais il est a 
remarquer qu’il emploie un plus grand nombre de lettres que ne comporte l’alphabet arabe.) It 
is however possible that the French author was just confused by the peculiarities of the Central 
Sudanic writing style. 
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consonants for Kaidi Kanembu. For Old Kanembu - the earliest written variety of 
Kanuri and Kanembu - 23 consonants have been reconstructed (Bondarev 2014b). 
Compared to these numbers, Arabic has 28 consonants represented in writing by 
28 graphemes. Of the lesser numbers of Kanuri / Kanembu consonants, only two 
are potentially problematic for choosing an appropriate Arabic letter. One is a 
bilabial plosive /p/ (occurring only in syllable final position) and the other is a 
voiceless alveolar affricate/tf/ (or in earlier text /ts/). The consonant /b/ is written 
either as <b> or <f> and the consonant /tf/ as <t>. 

Other choices of Arabic consonantal letters are based on almost straight¬ 
forward correspondences between the remaining Kanuri consonants and hom- 
organically similar Arabic consonants. The fact that Kanuri has four prenasa¬ 
lised stops mb, nd, nd 3 (*ndz in Old Kanembu) and ijg (analysed differently as 
monosegmental or bisegmental) which are absent in Arabic does not complicate 
orthographic choices, because these consonants can be written with the existing 
Arabic letters either as monographemic non-sonorant consonants or as digraphs. 
Monographemic encoding of prenasalised stops is the most typical convention so 
that mb, for example, is written as <b>. Digraphic orthography is usually applied 
for nasal-plosive sequences when they occur at syllable boundaries (Bondarev 
2014b, 128-32). Such sequences are perfectly normal in Arabic and writing them 
in Old Kanembu or Kanuri does not require any graphemic innovation. Thus, the 
sequence /n-d/ is written as <nd> both in Old Kanembu/Kanuri and Arabic, for 
example, andV we’ and inda ‘at’ respectively. 

Given that the choices of appropriate consonant letters were not complicated 
by any implosive or globalised consonants (nonexistent in Kanuri, unlike Hausa) 
and given that the number of Arabic graphemes is higher than the number of 
consonants in Kanuri or Kanembu, there was no apparent need for Kanuri and 
Kanembu scribes to invent new graphemes. 

The consonant letters which stand for the Arabic phonemes absent in Old 
Kanembu were not redundant, rather they were utilised in two different ways. 
One was the retention of Arabic spelling for Arabic loanwords, and the other was 
free spelling variation. These nine letters are sad, dad, ta’, za’, ' ain, qaf, xa \ da’, 
and sin. 

Retention of Arabic spelling in Old Kanembu writing is illustrated here by the 
letter dad. One of the most frequent words in Old Kanembu is <lard> /lards/ from 
Arabic al-’ard ‘earth’. This is typically written above or in close proximity to the 
Arabic source word. Another example is the verb <daro> ‘to harm’ from Arabic 
darra. Similar to <lard>, <daro> occurs in visual proximity to the Arabic darra. 
However, it is also often written above the Arabic zalama ‘to do wrong; to harm’. 
This means that the spelling of Old Kanembu <daro> was not simply dependent 
on visual correspondence with the Arabic graphemic source word but rather was 
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an abstract orthographic rule which could be reconstructed as prescribing that 
‘Arabic loanwords be written with the relevant Arabic consonants’ (an etymo¬ 
logical spelling comparable to the orthography of the English loanwords cliche, 
haute, kitsch, or oeuvre). 

These same nine letters are sporadically used in free variation with the letters 
encoding homorganic phonemes or phonemes which have a similar manner of 
articulation. For example, in Old Kanembu writing, sin ~ta’~ sin are used for /s/, 
za’ ~ dal ~ zal for /z/ and qaf~ /cd/for /k/. 

In the earlier Old Kanembu manuscripts these spelling variants are much less 
common than in the later ones. The Borno Qur’an manuscript MS.1YM (most pro¬ 
bably written before the end of the 18 th century) is a good example, showing more 
frequent variation than the Borno Qur’ans of the 17 th -century. 

One of these nine letters, sin, becomes more stable in the manuscripts of the 
19 th and 20 th centuries, in the sense that it is not used as a variant of sin to encode 
the phoneme /s/ but is only used to represent /§/. The emergence of sin as a letter 
in its own right was most probably conditioned by emergence of palatal conso¬ 
nants in Kanuri (e.g. *sin > sin ‘eye’, see Bondarev 2014b). 

Approximately around the same post-18 th -century period, the letter da’ - 
hitherto a rare variant of zal for /z/ — became much more prominent in the Old 
Kanembu of the Qur’an manuscripts, in the Kanuri of the kings’ lists [gargam ), 
in Kanuri writings commissioned by colonial officers and scholars, and in the 
Tarjumo and Kanuri didactic poems of the late 19 th century. It is still unclear what 
conditioned this increase in frequency. 


3.3 Vowels in Kanuri Ajami 

As is typical for other sub-Saharan Ajami traditions, the vowels in most Kanuri and 
(Old) Kanembu texts are written with the vocalic diacritics borrowed from Arabic 
vocalised writing. Some rare examples of writing vowels in plene - by means of the 
three Arabic letters used for long vowels - have been found in an Old Kanembu 
manuscript in Leeds University Library (MS 357, Bondarev 2007, 69). At present, we 
do not know whether this spelling convention existed alongside the more typical 
diacritic-based spelling of vowels or whether it was a one-off innovation. 

The number of vowel phonemes in Kanuri and Kanembu ranges from 6 to 11 
depending on dialect and/or analysis: 6 vowels in Mowar Kanuri (Bulakarima 
2001); 7 vowels in Yerwa Kanuri (Hutchison 1981, Cyffer 1998); 11 in Ngaldou- 
kou Kanembu (Jouannet 1982). The number of vowels in Old Kanembu cannot be 
established conclusively, but there were at least six (i, e, a, a, o, u). Other possible 
numbers of vowels may have been seven, nine or eleven. Given that the six pho- 
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nemes are clearly distinguishable in Kanuri and (most of the) Kanembu Ajami 
texts, we will only be dealing here with orthographic conventions used in writing 
these vowels. Of the six, the most ambiguous are /o/ and /u/ since they are under- 
specified, being represented by the same diacritic sign damma, used in Arabic for 
the short vowel /u/. However, even in the earliest extant manuscripts, the /o/ in 
word final position was often differentiated from /u/. 

The most typical graphemic solution for encoding /o/ in final position was 
a combination of the damma with the letter waw followed by the letter alif sur¬ 
mounted by a sulcun sign. The earliest spelling of this type occurs in the Qur’an 
manuscript MS.3ImI in the annotations written before 1669. For example, in 
kasiko ‘it will be’ (fol. 13r; Q.2:137). wm 

Another manuscript of a similar age (written before 1689) has the same 
orthography for final /o/. One can see a sukun above alif in tatarto ‘he will enter’ 
(Q.84:12; MS.Arabe 402, fol. 250r). VP” 

The same spelling is found in the Qur’anic manuscript MS.2ShI< (e.g. Q.2:19, 
and is very consistent throughout the manuscript) and in three other early 
manuscripts such as MS.4MM, MS.Konduga, and MS.Kaduna.AR.33-1. The histo¬ 
rical gargam texts written in Kanuri show the same damma-waw-alif-sukun enco¬ 
ding of final /o/ (MS.H 279-282, Bondarev 2014a). 

Nevertheless, variations in spelling do occur in most of the manuscripts. One 
common spelling is marking /o/ with the same combination of letters and diac¬ 
ritics but without a sukun above alif as in MS.Arabe 402 sasiko ‘they will be (in 
hell)’ Q.82:14, fol. 249r. 

A less frequent variant of the previous spelling of final /o/ is using only 
damma and waw. Although potentially ambiguous with /u/, it is very consistent 
in MS.Kano.Tahir. 

The combination ‘sukun above alif is undoubtedly modelled on an orthogra¬ 
phic rule in the Qur’an manuscripts, namely, marking the alif as a silent letter in 
the ending of the perfect forms of the verbs in the 3 rd person plural, e.g. qalu ‘they 
said’. The frequency of this form in the Qur’an is very high and many Old Kanembu 
glosses are written next to such verb forms, creating an orthographic harmony 
between the spelling of the Qur’an and that of the commentary in Old Kanembu. 
For example, in the dated Qur’an MS.3ImI, the Qur’anic phrase Q.2:71 wa ma kadu 
yafaliin ‘they almost did not do it’ or ‘they were on the point of not doing it’, where 
the underlined verb kadu is written with the final alif surmounted by a sukun, is 
translated into Old Kanembu bukiya kisadiro tadikibu, with the final vowel in the 
last word written exactly as in the Qur’anic kadu (fol. 8r; Q.2:71). 3 1 


3 It is unclear whether ltisadiro is written with damma above ra ’ or with a sulcun. 
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This particular case can be ambiguous because the negative suffix bo might have 
been pronounced \hu:] as in Tarjumo, but other instances of such spelling clearly 
correspond to final /o/ as in the examples above. 

A peculiar type of marking /o/ in final position is found in MS.Konduga, 
where the letter ha’ is vocalised by a damma (e.g. fol. 18r; Q.2:199) <walsikin- 
ruhu> corresponding to what can be reconstructed as *walsikinrd ‘from where 
they emerged’, the final -ro possibly being an adverbial marker). 

Some lexical items with final /o/ (such as in ago ‘thing’) were written with 
reduced notation (using only damma + waw ) and are found as early as the 17 th 
century in MS.3ImI. The spelling of this word has been consistent throughout the 
history of Ajami in Old Kanembu and Kanuri, starting from the Borno Qur’ans of 
the 17 th and 18 th centuries to the early 20 th century (as seen in Benton 1911, VII). 
However, some items with final /o/, for example the indirect marker -ro, were 
written in an underspecified manner, that is by using only damma - the sign for 
either /o/ or /u/. What is remarkable is that, in all known manuscripts, -ro was 
never written differently, which shows that the underspecified spelling of -ro was 
highly conventionalised across time and manuscripts. 

In general, the final vowels in graphic words (not necessarily matching the 
lexical units) had the highest tendency for standardisation, on condition that 
these words occur frequently. One such item is the 3 rd person singular suffix -i in 
perfective verb forms which tends to be written with the letter yd’ [gulji ‘he said’). 
Another item is the adverb afi (reconstructed as a hi, modern form awi) ‘what’ 
having already been standardised in the 17 th -century manuscripts (j»i Q.82:18, 
Arabe 402, fol. 249r). 

It is interesting that, starting from the earliest manuscripts, most of the 
orthographic conventions were already in place. Indeed, the spelling principles 
of many frequent items did not change over time, including ago ‘thing’, afi ‘what’ 
/ ‘which’, siki ‘there is’ (e.g., MS.3ImI Q.3:78; MS.2ShI< Q.2:157), demonstratives ti 
‘that’ and ni ‘those’, independent pronouns hu ‘I’, ni ‘you’, ti ‘he/she’, handi ‘we’, 
nadi ‘you (pi.)’ and tandi ‘they’. 

One significant exception to the spelling of the final /i/ in the pronouns is 
MS.3ImI, where this segment is written in four different ways. The first (and infre¬ 
quent) spelling uses a single vocalic sign kasra (e.g. tandi ‘they’) and the three 
other spelling combinations are based on the letter ha’; these ha’-based spellings 
are as follows: (1) kasra followed by the letters ya’ and ha’ (tandih ); (2) kasra fol¬ 
lowed by yd’ and the letter ha’ vocalised by kasra (nandihi); (3) kasra followed by 
ha’ and kasra (tandihi ‘they’, nandihi ‘you pi.’). 
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1 . 

2 . 

3. 




tandih (Q.3:20-21) 
nandihi (Q.3:28) 


nandihi (Q.2:233) 11 


tandihi (Q.3:91) 


The word final ha’ deserves special attention because of its several usages in Old 
Kanembu and Kanuri writings. Beside the above examples where the use of ha’ 
could have been motivated by phonological features such as high tone or length 
of the vowel, the same (earliest) manuscript MS.3ImI has the letter ha’ for demar¬ 
cating the terminus of the graphic (a nd possi bly also prosodic) unit isnuyi-ka ‘to 
the deceased’ written as <isnuyikah>: isnuyika (deceased.DO) ‘(Allah will 

bring life) to the deceased ones’ Q2:73. 

This ‘otiose’ use of the letter ha’ is less common in some manuscripts and 
more common in others. Some pages of MS.2ShK give the impression that there 
was a strong tendency for ha’ to become a standard convention for marking the 
final demarcation of lexical and phrasal units. The idea that something was going 
on in the scribes’ minds about the usage of this letter in final position can be 
inferred from the variant spellings of the final graphemic segments. Firstly, ha’ 
is most frequently marked with a sukiin (a zero-vowel sign), but sometimes it is 
left unmarked as in later manuscripts. Secondly, the items which do not have 
ha’ at the end are all function words, grammatical markers and pronouns, such 
as the direct object clitic ka, indirect/adverbialiser clitic ro, ablative postposition 
lean, definitive determiners ti <ti> ‘this’ and ani <ani> ‘those’. The only gramma¬ 
tical items written with the final ha’ (rarely with a sukiin above) are the subject 
marker -yi <yih> and the genitive -bi <bih>. Thus, the tendency in the manuscript 
MS.2ShK is as follows: the final ha’ is used in content words. Exceptions are the 
content words with final /o/ which are written in “full” orthography ( damma- 
waw-alif-sukun).' 1 

Two grammatical elements - the subject marker yi and genitive marker bi - 
are written with ha’ at the end. No other function word ends with the letter ha’. 
One representative example of this spelling convention in MS.2ShK is the recto of 
folio 13 corresponding to Q.2:120-125. 


4 The use of final ha’ with or without sukun may be influenced by the various ways of writing 
wordfinal /-a/ in Arabic. The same explanation may apply to such uses of ha’ in Hausa ma¬ 
nuscripts from the 19 th century up to the present (see below). 
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This (leaning towards an) orthographic rule of writing the letter ha’ in 
MS.2ShI< does not feature in other manuscripts. For example, in the Borno Qur’an 
manuscript from Konduga (MS.Konduga), ha’ is used for tagging the end of a 
content word, except for the content words with final back vowels u and o. But 
ha’ is also used to signal the end of most of the grammatical and functional items, 
including the subject marker -yi <yih> and the genitive -hi <bih>, but excluding 
the partitive clitic kami ‘from within’. 

In the genealogical gargam manuscripts of the late 19 th century, ha’ is not 
used at all, with the exception of MS.H 282, where the diphthong ai in the word 
mai ‘ruler’ is written with a final hd ’<mayh> (the same encoding of the diphthong 
in the word mai is also attested in the earlier Qur’an manuscript MS.1YM). 

In more recent manuscripts of the qasida genre there seems to be a tendency 
to write the final ha’ when there is phonetic (non-phonological) aspiration after 
a and o at the end of each hemistich. This is shown in the following line of a 
didactic poem on the fundamentals of Islam. 

<Allah meji din-ro no'atah gabtuma tamun numbudnah> 

Alla meji dinro noata, ngabtdma samun mmbuzana 

‘The eternal existence of God is known, (it is) like (something) remaining endlessly’ 

(al-Barnawi 1997, 2) 

The hemistich final e, a and u that have prosodic lengthening (i.e. phonetically 
long vowels) motivated by poetic intonation, and also final i, are typically not 
marked by the letter ha’. 

Writings in Kanuri, Kanembu and Old Kanembu belong to what we call the 
‘imala type’ of Ajami; this is manifested in a specific representation of /e/. Imala 
is a dot below the letter indicating a fronted /a/ pronounced closer to [e] in the 
Qur’anic Arabic of the Warsh variety, i.e. the variety used in the Qur’anic reading 
tradition transmitted by ‘Utman ibn SaTd al-Qutbl. 

In the Old Kanembu of MS.2ShK, imala indicates a vowel /e/ (e.g. Q.2:137). 
The combination imala + alif in the form of ya’ {alif maqsura) + a short alif above 
may have been used for either prosodic long [e] or a high tone /e/ (Q.2:140). Given 
the temporal proximity of this manuscript to the dated (1669) MS.3ImI (Bivar 
1960, Bondarev 2014a), this is probably the earliest attestation of the use of imala 
for this purpose. However, in many other cases what corresponds to /e/ in the 
later manuscripts (e.g. MS.1YM) or in modern Kanuri, Kanembu and Tarjumo is 
written using kasra, which is the default grapheme for /i/. For example, the agen- 
tive marker ye and genitive be are always written as <yi> and <bi>. 
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In contrast, the scribe of the dated MS.3ImI writes kasra for what corresponds to 
imala in MS.2ShI< or to /e/ in all known modern varieties. Nor does the 17 th century 
manuscript (MS.Arabe 402) have imala. This is interesting because fronted /a/ in 
the Qur’anic Arabic transmitted in the Warsh version is written with imala and it 
is unclear why the scribes of these two early manuscripts did not copy the diacritic 
sign for a similar sound in Old Kanembu (as many other scribes did). It would be 
tempting to assume that, historically, there was no /e/ in Old Kanembu and thus it 
was absent in the language that served as the source for this learned variety. This 
is however hardly plausible. Firstly, all known Saharan languages to which Old 
Kanembu belongs have /e/. Secondly, the scribes of the manuscripts contemporary 
with these two do use imala for the same lexical and grammatical items. 

Out of the six vowels that can be reconstructed for Old Kanembu, the mid-cen¬ 
tral schwa /a/ has the most unambiguous orthography. It is typically written with 
sukiin (zero vowel). This is found in all later manuscripts and types of Kanuri Ajami. 


3.4 General tendencies in Kanuri orthography 

As can be seen in the examples described above, orthographic conventions tend 
to stabilise in one single manuscript and differ from manuscript to manuscript 
(see also Bondarev 2014a, 145-6, 2014b, 113). One possible factor for the inter¬ 
nal regulation (or levelling) of a manuscript’s orthography is the shared space in 
terms of both writing support and the physical location of the manuscript within 
the same group of scribes. As suggested previously: 

In the past, the manuscripts served a surrogate role of public platforms in the competition 
of the auditory-vocal and written forms, i.e. when paper was more expensive the written 
form was visually shared by more people - teachers and students - and so conventions in 
spelling were due to a collective writing space. (Bondarev and Tijani 2013,133) 

However, there are a number of orthographic conventions which remain the same 
across manuscripts, irrespective of the time and place of their production. For 
Kanuri, six such stable tendencies may be identified. 

1. Underrepresentation of the nasal segment in prenasalised stops, such as mb, nd, 
nd$ (*ndz in Old Kanembu) and pg written as <b>, <d>, <j> and <g> respectively. 

2. Retention of Arabic spelling for Arabic loanwords. 

3. The letter kaf used for k and the letter ghain for g (as in Hausa Ajami dis¬ 
cussed in Section 4 below). 

4. Writing schwa a with a sukiin sign. 
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5. Tendency to distinguish between o and u at the end of graphic words or 
graphic phrases (not necessarily matching lexical or phrasal units). 

6. Orthographic uniformity of higher frequency lexical and grammatical items. 

Some of the conventions listed above are more widespread and found in several 
writing traditions (for example, underrepresentation of the nasal segment in 
prenasalised stops), while others are more region-specific and only shared within 
a literacy contact area (like the letter kaf for k and ghain for g, or the Arabic “weak” 
letters used for suprasegmental features such as the combination of stress and 
tone); yet other conventions are language-specific (like schwa written with the 
sukun in Kanuri and Old Kanembu). 

Some high frequency items in Old Kanembu could also have been encoded in a 
more standardised way due to their prosodic prominence at the end of an intonational 
phrase, as is evidenced in modem day practices of commenting the Qur’an in Tarjumo. 
Many phrase-final and sentence-final items (e.g. [tjigi:] ‘there is’, [gen:] ‘in’) have such 
a prominent pitch and length in Tarjumo recitations that the whole practice of com¬ 
menting the Qur’an is sometimes referred to by laymen as ‘cigi and gen recitation’. 

One factor accounting for the orthographic uniformity of many lexical and 
grammatical items in the Kanuri Ajami writings of the later period (e.g. the 
gargams and texts commissioned by Europeans) might have been the scholarly 
background shared by the scribes: 

the gargams [the late nineteenth century] are written in a careful, sometimes calligra¬ 
phic hand comparable to [...] the Borno Qur’anic manuscripts [...] The [...] conventions in 
Benton’s texts [1910s] [...] together with the confident calligraphic hands [...] betray their 
authors’ affiliation with the ‘ulama’ circles and suggest that they were used to write com¬ 
mentaries in a more codified Old Kanembu (Bondarev 2014b, 139-40). 


4 Hausa literacy in Arabic script 

The development of Hausa literacy in Arabic script, or the Ajami tradition, was, 
until the 20 th century, both successful and marginalised. The marginal character 
of Hausa Ajami was primarily motivated by the position of Hausa as a secondary 
written language (after Arabic). Marginalisation of written Hausa was both con¬ 
ceptual and literal, as we find many Hausa glosses on the margins of texts written 
in Arabic. Such glosses are rarely taken into consideration in the descriptions of 
the manuscripts. 

It is thus not surprising that many scholars, starting with such authorities as 
Mervyn Hiskett, largely ignored the Hausa glosses when describing West African 
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Ajami writings. This is in strong contrast to the approach dominating Irish or Sla¬ 
vonic studies. In the latter, the early cases of ‘native’ marginal usage have been 
extensively studied and generally interpreted as the beginning of the respective 
local literacies. 

Very little is known about Hausa or related Chadic languages before the 18 th 
century, so one has to rely on linguistic reconstructions. In the 16 th century, Leo 
Africanus stressed the role of the so-called Gobir language in the region between 
the Niger and Lake Chad, that is, between the Songhay Empire and Borno. Gobir 
was one of the Hausa states, and it is generally assumed, with some degree of 
certainty, that the Gobir language was an early form of Hausa. Later on, the Hausa 
people and language were mentioned as Afnu, apparently a Kanuri word (Afuno) . 5 

Before the early 19 th century, Hausa as a written language was in no compa¬ 
rable to Soninke, Fula or Berber (in any script). In Western Sudanic Africa, the 
spread of Ajami in scholarly varieties of Soninke, Fula and Wolof might have 
been influenced by the peculiarities of historical development, especially in 
the coastal regions where Islamisation encountered an ever growing European 
presence (with written contracts, treaties, etc.). 6 In Central Sudanic Africa, Old 
Kanembu (Tarjumo) was commonly used in Qur’anic glosses at that time. The 
written rendering of words and sentences in Old Kanembu was already a norm 
rather than an exception. In all such cases, a certain tendency towards standar¬ 
disation was attested. 

Not the slightest evidence of written Hausa can be found for that period. There 
are a few words (personal and place-names) in Arabic texts, but this in no way com¬ 
pares to the written tradition of the Middle Niger region. Later, there are a few refe¬ 
rences to the Hausa literature produced in Katsina as early as the 17 th century by Wali 
dan Masani (‘Umar b. Muhammad b. ‘Abd Allah b. Nuh al-Barnawi al-Kashnawi, or 
Dan Masani(h), b. c. 1003/1594-5, d. 2 Rajab 1078/18 December 1667). 7 Hausa sources 
mention only one by Wali dan Masani by Yahaya (1988,38) and this particular work 
is interesting, in as much as it was recited from memory by a member of Wali dan 
Masani’s extended family three centuries after his death, and not from a manuscript. 
This means that the Katsina scholar could indeed have authored the text, but there 
is no proof as of yet that the work was ever written down. 


5 Tomasz Habraszewski (1967, 63), in his study of a 17 lh -century vocabulary of Kanuri collected 
by Evliya Celebi, a Turkish traveler, suggested that one word, gurasa ‘bread’ written as kurasa, 
might be Hausa: ‘This is a Hausa word, not known to a Kanuri dictionary’. The word in question 
was a borrowing from Arabic, and is also used in the Sudanese and Chadian/Shuwa Arabic dia¬ 
lects with the same meaning, so it was not necessarily Hausa. 

6 Tal Tamari, personal communication. Also see, e.g., Brooks and Mouser 1987 

7 On Dan Masani and his works see fohn 0. Hunwick et al. 1995, 29-20. 
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It is quite possible that this poem and some other Hausa texts had been 
transmitted orally since the 17 th centuries, but the dates for written Hausa (Dan 
Masani’s lifetime, born c.1003/1594-5, died in 1078/1667 according to Hunwick 
et al. 1995, 29) suggested by Yahaya (1988, 31-42) have proved incorrect. The 
manuscripts referred to by Yahaya are much more recent, although we cannot 
exclude the possibility that some texts might have been copied from earlier works. 

Yahaya’s view of Hausa as a written language before the early 19 th -century 
jihad and the creation of the Sokoto Caliphate is similar to that of Gottlob Adolf 
Krause, a German scholar and pioneer of Ajami studies in the 19 th century. 8 

The first dated example of Hausa Ajami (in a multilingual text) was written 
in the Caribbean diaspora in 1817, although there are a few non-dated poems in 
Hausa which have been cited as examples of writing before the 19 th century. The 
list of such poems is extremely short, and we do not know if any of these had 
been transmitted in written form prior to the foundation of the Sokoto Caliphate. 
In one such poem, known as ‘Billahi arumu’ and attributed to Muhammadu na 
Birnin Gwari, Hausa is referred to as Baubauci, literally, ‘pagan language’. The 
unknown author also tried to explain why he dared to use this language instead 
of Arabic, and this type of explanation - even ‘defense’ - is often found in non- 
Arabic writings, both in Sudanic Africa and elsewhere in the Islamic world. Such 
a work, if dated, would mark an important step in the growth of Hausa Ajami, 
from simple glosses to an authored literary production. However, there is a sig¬ 
nificant discrepancy in the dates given for Muhammadu na Birnin Gwari’s life. 
According to Hunwick et al. (1995, 233) he flourished in 1850. Yahaya refers to a 
much earlier date, giving 1178/1758 as the date of his birth. 9 

Turning to the standards of Hausa as a written language since the early 19 th 
century, we can deduce a few general rules. First, until the end of the pre-colonial 
period there was no uniform tradition of Hausa literacy. Geographically speaking 
(as mentioned in Section 2), there were two poles of attraction. One variety of 
Hausa Ajami may be described as Western or Sokoto-centred and close to the 
written traditions of the Central Niger region. Another variety (mainly in Kano) 
looks like an offshoot of the venerable centuries-old tradition of Borno (Kanuri) 
Ajami. Both were reflected not just in their specific conventions of transcription, 
but also in their respective styles of handwriting. Even today, when buying an 
Ajami book in Northern Nigeria, one can easily refer to the style used in Kano as 
opposed to that of the printed Middle Eastern publications in Arabic and, interes¬ 
tingly, that of Zaria Ajami (Sokoto is now only a minor centre of Ajami publishing). 


8 Krause 1884, 29, see footnote 10 below. 

9 Yahaya 1988, 45. 
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The east-west division in Hausa Ajami standards is not surprising, given 
the role of “Western” Songhay-speaking and “Eastern” Kanuri-speaking Islamic 
scholars and schools in the development of learning in the Hausa states. Looking 
to the Western tradition, Gottlob Adolf Krause wrote: 

Prior to the Ful (Fulbe) the Hausa possessed a script, also Arabic - with some slight vari¬ 
ations, and, if my inquiries are confirmed, the Songhai had written their native language 
before the Hausa. 10 

This hypothesis has only partially been confirmed, as there are many Songhay 
words in the historical chronicles written in Arabic, such as Ta’rikh al-Sudan. The 
practice of advanced studies in the “East”, that is, in Borno and adjacent Hausa 
cities such as Hadejia, continued well into the 20 th century. At the same time, 
scholars coming from the “French” territory (mainly from Niger and often spea¬ 
king the Songhay Zarma languages) continued to resettle in Nigeria, establishing 
their schools and bringing with them the ‘western’ style of writing and transcrip¬ 
tion of non-Arabic texts. On the other hand, many Songhay-Zarma speakers came 
to the centres of Islamic learning in Nigeria to study, so the migration was not 
unidirectional. 11 


4.1 Hausa orthographic traits and conventions 

Except for a very small number of publications, the marginal usage of Hausa in 
glosses remains largely unexplored in Ajami studies. In the present paper, we 
are looking at the glosses found in the Arabic manuscripts produced in the 19 th 
century. The results of our research may be summarised as follows: 

1. Vocalisation typical for the Central Sudanic cultural area, i.e. a specific 
vowel-sign for the /e/ vowel; 

2. the ‘emphatic’ Arabic letters are either obsolete or used as symbols for those 
Hausa consonants which do not exist in Arabic; 

3. in a few cases, there is a tendency toward new symbol-creation. However, 
with the exception of the letter ta ’ with three dots, none of these attempts can 
be seen as really successful; 


10 ‘Vor den Fulen besassen die Haussaner eine Schrift, ebenfalls die arabische mit einigen ge- 
ringen Abweichungen, und wenn meine Erkundigungen sich bestatigen sollten, so hatten vor 
den Haussanern die Songhai ihre Muttersprache geschrieben’. (Krause 1884, 29). 

11 On the Zarma in Nigeria see e.g. Dobronravin 2000, 91-101; Gulbi and Bunza 2014. 
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4. the difference between the Western and Eastern conventions is diminishing 
over time, so that the only survival of it now is the difference in the styles of 
writing, but not in the transcription of Hausa phonemes; 

5. Some conventions, or graphic strategies, do exist, but they are fluid, being 
more an approximation to a standard rather than a rigid set of norms. This is 
especially true in the case of length and tone representation, as well as that 
of labialised and palatalised Hausa consonants. 

Each of these tendencies may now be explored in more detail. 


4.2 Vowels in Hausa Ajami 

In this section, we follow Philip Jaggar’s analysis of the Standard Hausa vowel 
inventory. According to Jaggar, 

Hausa has a 10 vowel system, comprising five basic vowels /i, e, a, o, u/ with phonemic 
vowel length, in addition to two diphthongs /ai/ and /au/ [...] In medial position in native 
words, only long /§/ and /o/ occur. If the syllable becomes closed by a coda as a result of a 
morphophonological rule, /§/ and /o/ automatically shorten to /e/ and /o/ and merge with 
centralized /a/ (only short vowels occur in closed CVC syllables). 12 

In the Hausa dialects, the picture is different. Ahmadu Bello Zaria wrote; 

It seems to us quite difficult, if not impossible, to provide a ‘comprehensive’ study of vowel 
differentiation across the numerous Hausa dialects examined. This is because vowel oppo¬ 
sitions may be quite unstable from dialect to dialect, from speaker to speaker and from 
utterance to utterance within the same idiolect. 13 


As for more specific variations, according to Zaria, ‘final /ee/ in the standard 
dialect corresponds to /ii/ in the areas around Zaria and Bauchi. This kind of 
correspondence generally happens after a nasal’ (Zaria 1982, 52). Thus, it is not 
surprising that in many Hausa manuscripts the same word may occur with either 
N or /e/. 

Writings in Sudanic African languages have a strong tendency to retain the 
vowel-signs. This may be explained by the syllable structure of African languages 
in the region. Nonvocalised texts are sometimes almost undecipherable; a Hausa 


12 Jaggar, 2001, 9-11. Vowel length is not marked in modern Latin-script orthography. 

13 Zaria 1982,183. 
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text is not easily understood if the vowel signs are absent in words with a CVCV or 
CVCVCV structure. A few exceptions with nonvocalised Hausa texts are known. 
However, this is highly unusual in the marginal glosses, where the very purpose 
of their usage is the clarification of the main Arabic text. Having studied a great 
number of glossed Arabic manuscripts, we can argue that a Hausa gloss is nor¬ 
mally vocalised. As for the main text, if it is written in Hausa, it can be predicted, 
with a great degree of certainty, that the text will be vocalised. When vowel-signs 
are not marked, it may be the result of hasty (incomplete) writing. Otherwise, 
one could suggest that nonvocalised texts served as written supports for the oral 
transmission of knowledge. In any case, lack of vocalisation is not a rule, and 
various explanations may be found for it. 

With the exception of the diasporas outside Sudanic Africa, the front vowels 
I'll and /e/ are marked with different signs, known respectively as wasali bisa 
(Arabic kasra ) and imala (a loanword from Arabic). As is the case in Kanuri Ajami, 
imala is a dot below a letter borrowed from the Warsh tradition of Qur’anic spel¬ 
ling. By the 19 th century, imala had already been used in Sudanic Africa, in Fula 
and Kanuri written conventions. 

In contrast to the systematic transcription of front vowels, the back vowels 
/o/ and /u/ were usually marked with the same sign, known as rufu’a (in Arabic, 
damma). This is interesting, as the same rule applies to local writings in Fula. 
On the other hand, Western Sudanic literacy in Fula and a few other langua¬ 
ges have an additional symbol for /o/, an inverted rufu’a. The question remains 
as to whether the Central Sudanic usage is archaic, preceding that of Western 
Sudanic Africa, or whether this is a case of the independent development of two 
divergent norms of transcription. In any case, Hausa literacy may be described 
in areal terms as a part of the “non-o” literacies in Sudanic Africa. Although the 
graphic combination of the damma with the letter waw followed by the letter alif 
surmounted by a sukun sign occurs both in Hausa manuscripts and in those of 
the Kanuri/Kanembu area, its use, unlike in Kanembu/Kanuri, is not restricted to 
encoding /o/. It might be argued that this feature was borrowed from the Borno 
tradition, but it became more decorative than graphemic. 

There were a few attempts to create symbols for o in Hausa Ajami in the late 
19 th and early 20 th centuries, but these were apparently rare and definitely unsuc¬ 
cessful. One such example is found in a Hausa manuscript from the Barne coll¬ 
ection at Special Collections, SOAS Library, University of London (MS 380271). 
In this manuscript, the o-sound is rendered with a combination of rufu’a above 
the letter and a small rufu’a below the same letter. This way of writing was cer¬ 
tainly borrowed from the Warsh tradition of Qur'anic reading, but the meaning 
of the combination in the Warsh was different, being a marker of long vowels in 
the affixed personal pronoun -hu. In the 20 th century, a new way of writing the 
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o-sound was invented, most probably in Zaria. As the Hausa o is almost always 
long, that is /o/, the combination used includes a supporting waw with a zero- 
sign (sukun) above it, to differentiate /o/ from /u/, which does not require an 
additional sukun above the waw. However, even now the /o/ is not marked in 
many modern publications in Hausa Ajami. 

An extreme case of fluidity is demonstrated in the marking of vowel-length 
and tones. The latter were most probably ignored or only recognisable in com¬ 
bination with the length of Hausa vowels. It seems that long final vowels were 
left unmarked if they were combined with low tone, as in <doki> (dold) ‘horse’. 
The long vowels were usually marked, as in the Arabic written tradition, with an 
additional waw, yd’ or alif respectively for /u/, /!/ and /a/. In the case of long /e/, 
two variants of symbol combination were initially applied, either an additional 
yd’ or a yd ' without dots (e.g. a stump in the medial position) with a so-called ‘red 
alif above the additional letter. The same ‘red alif could also be used without 
any supporting letter. Nowadays, the standard Ajami rule for /e/ is the use of a 
yd ’ without dots supplemented with a ‘red alif, which is no longer red in colour. 

In a few pre-colonial Hausa manuscripts, usually from present-day Ghana 
and Togo, one more method of length-marking was used before colonisation. 
Thus, a combination of hamza and sukiin could be found above the supporting 
letters (waw, yd’, alif. Adam Mischlich thought this was how the ‘mid-range 
vowels’ were marked. 14 This view has not been proven by any research, and in 
modern Hausa Ajami no such vowels are known. It cannot be excluded that the 
now forgotten combination of hamza and sukun was in fact used to mark a certain 
tonal pattern, but this is far from clear. 15 

Until the 20 th century, the letter alif could also be combined with an additi¬ 
onal diagonal line across the main letter (alif mai suka, ‘alif with spear’). In the 
glosses, ‘alif with spear’ was apparently not used. This variety of alif is uncom¬ 
mon in modern Hausa Ajami. 


4.3 Consonants in Hausa Ajami 

Standard Hausa has 32 consonant phonemes. Among them, the globalised set 
is represented with so-called “hooked” letters in modern Latin-script orthogra¬ 
phy. According to Jaggar, ‘6 and d are laryngealised (often implosive) bilabial and 


14 ‘Hamza da damri (Hamza in Verbindung mit damri) iiber einem Konsonanten deutet an, daK 
die Silbe weder lang noch kurz, also mittellang ist.’ Mischlich 1906, xxxiii. 

15 See also Dobronravin, 2006,139, n. 36. 
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alveolar stops, hooked k is a globalised velar ejective, the digraph ts [s ? ] is an ejec- 
tive alveolar sibilant, and the digraph ’y is a laryngealised palatal glide (derived 
via reduction of a /diy/ sequence)’. Labialised and palatalised consonants are 
written as digraphs. With few exceptions, as noted by Jaggar, ‘the four palatalized 
/fy, ky, gy, ky/ and three labialised /kw, gw, kw/ unit phonemes all contrast with 
the corresponding plain segments before /a(a)/ (/fy/ is a marginal phoneme)’. 
There are also two ‘R-phonemes’, the alveolar tap/roll /r/ and the retroflex native 
flap r [j;], not marked in modern Latin-script orthography (Jaggar 2001, 5-8). 

The picture is different in the Hausa dialects, especially in the Western dialec¬ 
tal cluster and in the diaspora. The globalised consonants in the Western dialects 
also include, e.g. [tf], [s w? ], [b y? ], [b w? ], [d w? ] (Gouffe 1969, Zaria 1982, 50). Most 
of these additional consonants occur and are phonemic only in word-initial posi¬ 
tion before /a/. On the other hand, the consonant [tf] regularly corresponds to the 
standard [s 7 ]; in word-initial position both [tf] and [s 7 ] occur, and the difference 
between them is phonemic. In the Hausa diaspora, there is a tendency towards the 
reduction of the globalised set. In Ghana, the globalised consonants are replaced 
with non-globalised ones, such as /s/ instead of [s 7 ] (Zaria 1982,178). Moreover, 
there are local differences in the treatment of loanwords. According to Zaria, ‘in 
the area around Daura one finds standard /d 7 / corresponding to /s 7 /. This kind of 
correspondence is limited to names which originate from Arabic’ (Zaria, 1982,50). 

Turning to the transcription of Hausa consonants, it may be said that, together 
with the Kanuri writing tradition, Hausa literacy belongs to the ‘g-ghayn area’. It 
means that, with a few exceptions in the diasporas, the consonant /g/ is invaria¬ 
bly written with a ghayn (Hausa angai ). The same graphemic choice is also found 
in Eastern Fula Ajami. To a lesser extent, the g-ghayn area also includes parts 
of the Songhay region and a few languages in the Northern part of present-day 
Ghana such as Mamprule and Dagbane. 16 The g-ghayn area is opposed to the g-kaf 
area in Western Sudanic and North Africa. In the literacies of the latter regions, 
/g/ is represented with the letter leaf, either as such or with three additional dots 
to mark the voiced counterpart to /k/. 

The ‘emphatic’ Arabic letters have been used in different ways throughout 
the history of Hausa literacy. This variety of usage may reflect dialectal variation. 


16 This is witnessed in some manuscripts from the collection of the Institute of African Studies, 
University of Ghana, such as IASAR/28, Abd Allah b. al-Hajj al-Hasan’s Nisab al-dahab , com¬ 
posed in Dagbane in 1361/1942-43 (Hunwick 2003, 597); or IASAR/54, Alfa Muntaga’s poem in 
Mamprule in praise of Shaykh Ahmad al-Tijani ( qasidafi madh li-Shaykh Ahmad al-Tijani). The 
xerox copies of these unpublished manuscripts were consulted by Nikolay Dobronravin in the 
Herskovits Library of African Studies of the Northwestern University, Evanston, USA. 
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In Western Hausa dialects there are two different consonants, /tfV and /s 7 /, both 
corresponding to /s 7 / in modern Standard Hausa and close Hausa dialects (the so- 
called Kano dialect or dialect cluster). As there were two different types of Hausa 
Ajami literacy, one of them was probably influenced by dialectal pronunciation. 

The closest example of a standard can be seen in the use of qaf as opposed 
to the letter kaf By the end of the 19 th century this pair had already been used to 
differentiate two consonants, plosive /k/ and globalised /ft/ in Hausa. 17 However, 
before this standard developed, there existed another set of rules for the transcrip¬ 
tion of Hausa consonants. In earlier manuscripts, both kaf and qaf are invariably 
found to denote both /k/ and /k/. The use of qaf for Hausa /k/ was then retained 
in some manuscripts, but only in loanwords from Arabic, such as <arziqi> (to be 
read as arziki ‘wealth, prosperity’, from Arabic al-rizq, with assimilated article 
al-) or <loqaci> (to be read as lokaci ‘time’, from Arabic al-waqt ). In modern Hausa 
Ajami, the use of qaf is restricted to the transcription of globalised /k/, and the 
historical spelling of Arabic loanwords is normally ignored. 

If the differentiation between kaf and qaf was made relatively early in Hausa 
Ajami, this was not the case of the other ‘emphatic’ letters. The letters dal and 
ta’ are nowadays a regular pair representing respectively plosive /d/ and globa¬ 
lised /d/. In the precolonial Hausa manuscripts and marginal glosses, this oppo¬ 
sition was less common. Both consonants could be written as dal. Comparable 
to the case of kaf and qaf, the choice between dal and ta’ was apparently of little 
importance for the Hausa scribes, as long as they could recognise the words. In 
borrowings from Arabic the use of ta’ was more consistent than in other strata of 
Hausa lexicon. 

The letter ta’ could also be used to denote another globalised Hausa con¬ 
sonant, namely /s 7 /. As mentioned above, in modern Latin orthography this 
phoneme is transcribed with the digraph ts. In Hausa Ajami, there was no visible 
preference for the reading of ta’ as /d/ or /s 7 / before the 20 th century, and it seems 
that such preferences were more individual than regional. Until the 20 th century, 
the letters ta’, za’, sin and ta’ were all used to transcribe the sound which corres¬ 
ponds to globalised /s 7 / in modern standard Hausa. 


17 Such use of qaf for Hausa /k/ in loanwords is found, e.g., in the Hausa manuscript (dated 
1879) written in Ghat, in today’s Libya, where one can see <qabila> (to be read as kabila ‘tribe’, 
from Arabic qabila ) and <yaki> (to be read as yafd ‘war’). The Ghat manuscript does not have a 
special sign for /e/; the final /o/ in verbs is systematically marked with a combination of waw 
and alif with sukun, as in <ya-do>, to be read as ya zo ‘he came’. As a ‘Middle Eastern’ diaspora 
feature, the qaf in this manuscript has two dots above, while fa is written with a dot above or 
below the letter (Ghanim 1998). 
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In the 20 th century, with the standardisation of Hausa in Latin script, glot- 
talised /s 7 / was transcribed with an extra letter, the ta’ with three additional 
dots. Since the 1950s this usage has been strengthened by the publication of 
Boko(Latin)-Ajami transcription tables. 18 The same extra letter is used in modern 
books printed in Ajami in Nigeria. This symbol has found its way into the set of 
additional Arabic symbols in the Unicode of the 2000s. The creation of ta’ with 
three dots is a rare case of a successful introduction of a new symbol in Hausa 
Ajami literacy. Unlike ta’, the Arabic letter za’ (with one diacritical dot) was only 
used for /z/; this letter is fairly rare in modern Hausa Ajami. 

Two centuries of Hausa Ajami development have resulted in the fixation of 
symbols for globalised /k/, /d/ and /s 7 /. The rendering of /6/ and / 7 y/ is another 
story, or perhaps two other stories. In the pre-colonial manuscripts and marginal 
glosses, these two consonants were not marked separately from /b/ and /y/, that 
is, the letters ba’ and ya’ were used to denote them. This lack of differentiation 
may be explained, first of all, by the absence of corresponding ‘emphatic’ letters in 
the Arabic written tradition. No single Arabic letter could be easily selected for the 
transcription of one of these Hausa consonants. Besides, certain dialectal influ¬ 
ence cannot be ruled out. In Western Hausa dialects, / 7 y/ is non-existent, corre¬ 
sponding to the sequence /diy-/, as in the pair cCiyauci, modern standard Hausa 
‘yanci ‘freedom’ and diya (singular feminine), corresponding to modern standard 
Hausa ‘ya ‘daughter; (historically), free woman’; (plural) - ‘ya’ya ‘children’. 

With the development of the Hausa Latin-based orthography, the letters B 
and ’y (a graphical combination of ’ and y) were created, and they are now used 
in most printed Hausa texts. This standard has also influenced Hausa Ajami. As 
a result, globalised /6/ has variously been marked with one, two or even three 
extra dots below the ba’. Today’s prevalent form is a ba ’-shaped letter with three 
dots below. This form is still far from convenient, as /6/ may be found in com¬ 
bination with the vowel /e/, resulting in an awkward combination of three plus 
one dot below the letter. Three dots below were also added to the letter ya’. In 
some manuscripts even today neither /6/ nor / 7 y/ is marked with any additional 
symbols. 

Interestingly, in a few manuscripts and printed publications, the globalised 
/ 7 y/ is rendered with the letter 'ayn and three additional dots above it. This is not 
surprising, if we take into consideration the way the letter ‘ayn has been read 
by most Hausa speakers. In both cases the standard pronunciation is that of a 
glottal stop, marked with a hamza in standard Arabic spelling. Less educated and 
diaspora scribes even wrote ‘Allah’ with an ‘ayn, as <‘alla>. Thus, the rendering of 


18 See, e.g. ‘Ajami Boko’ 1961; Wali/Binji 1969. 
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/ 7 y/ with an ' ayn plus additional dots may be interpreted as the transcription of a 
glottal stop with /y/ as an extra feature. 

Beside the transcription of globalised consonants, the development of a 
Hausa Ajami standard included the transcription of the affricates. In most Hausa 
dialects, these are /dj/ and /tf/, corresponding respectively to the letters/and c 
in modern Hausa Boko spelling. The use of c has long been the norm, but before 
it was developed, and sometimes even nowadays, the digraph ch is also used to 
denote the same Hausa phoneme. 

In Ajami, /dj/ has always been represented with the letter jim. Only in one 
rare diaspora document of 1817 is this letter also used to denote the voiceless 
affricate /tf/, as in <luqaji> lokaci. In the same rare document from Trinidad, the 
phoneme /dj/ is also represented with the letter dal. It is not clear to what extent 
this usage was influenced by the Western Sudanic tradition, where there was a 
more marked confusion in the rendering of affricates. In the great majority of 
Hausa manuscripts and glosses the letters dal and za‘ (also known in Hausa as 
zayra) are used indiscriminately even now. 

As for /tf/, there were initially two major traditions of its transcription. In 
Sokoto-centered manuscript practice, the phoneme was written with the letter 
sin. This usage may have been influenced by the peculiarities of Hausa pro¬ 
nunciation in an environment dominated by Fula speakers. In Kano, as in the 
manuscripts of Borno, the same consonant was written with the letter ta’. This 
usage has been retained and has nowadays become the norm. 

It is worth mentioning that, in Hausa Ajami, the letter ta’ is also found with 
two more variants of reading. In borrowings from Arabic, and especially in perso¬ 
nal names, ta’ may be read as /s/, as in < l utman> (to be read as Usman). In a few 
Hausa manuscripts and marginal glosses in the 19 th century the same letter was 
also used to denote globalised /s 7 /, as in <tdrd> (to be read as tsoro ‘fear’). This 
usage seems to be totally forgotten now. 

A further pair of letters in Hausa Ajami are ra’ and dad. The latter turned out 
to be convenient for representing both /l/ and the retroflex r-phoneme in Hausa 
manuscripts and marginal glosses. The use of dad for /l/ is common in Sudanic 
Africa and may be explained by the same reading in some parts of the Arab world. 
As for the retroflex r, there were a few Hausa manuscripts where this phoneme 
was consistently marked with dad, e.g. sarki ‘king, chief’. In modern Hausa Boko 
spelling there is no difference between the two r-s, and some Hausa dialects do 
not have the retroflex r at all. Thus, in modern Hausa Ajami, the letter ra’ corres¬ 
ponds to both phonemes. As for the letter dad, it is only found in a few loanwords 
from Arabic, mainly in personal names. 

The flexibility of Hausa Ajami standards is still visible in the treatment of 
labialised and palatalised consonants as well as for vowel-length and tone. As 
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mentioned above, in modern Hausa Boko, labialised and palatalised consonants, 
mainly /k'/, /k'/, /g'/, /k 1 /, /ft 1 / and /g 1 /, are marked with the respective digraphs, 
kw, kw, gw, ky, ky and gy. Other cases of labialisation and palatalisation are much 
more rare. There were no ‘spare’ letters in the Arabic script to denote such con¬ 
sonants, and digraphs are not common in the Arabic written tradition. So it is 
understandable that there was little incentive to underline labialisation or pala¬ 
talisation of consonants in pre-colonial Hausa writings. In fact, such consonants 
were marked through the use of ‘similar’ vowel-signs. In most manuscripts and 
marginal glosses, the corresponding signs were rufu’a, for the labialised conso¬ 
nants, as in <gubru> (to be read as gwabro), modern Hausa Boko gwauro ‘bache¬ 
lor’, and imala for the palatalised ones, as in gera (to be read as gyara), modern 
Hausa Boko gyara ‘to fix’. In a single manuscript, already mentioned above ( Litta- 
fin tuba, from Barne’s collection at SOAS), the scribe did try to mark the labialised 
consonants with an additional small rufu’a below the main letter. This attempt 
was, however, unsuccessful, and no other manuscripts in this style have been 
found. In modern Hausa Ajami, influenced by the Hausa Boko standards, both 
labialised and palatalised consonants have been marked. This was done either by 
adding three dots to the original letter ( kaf | qaf ghayn ) or through a combination 
of symbols. In the latter case, the letter waw is supplemented with a madda above 
it to denote labialisation, as in kwacCo ‘frog’. To mark a palatal consonant, the 
letter ya’ with madda is added to it. This approach is now prevalent, but not yet 
uniform in publications printed in Nigeria. 

The use of final ha’ (in some manuscripts both with or without sukun) may 
be influenced by the practices of writing the word final /-a/ in Arabic. This usage 
is also found in modern Ajami publications where the same words occur with or 
without final ha’, like <salla> and <sallah> for salla ‘prayer’. 

The general tendency in the development of Hausa Ajami may be described 
as a slow growth of a set of rules. 19 The trend toward the standardisation of Hausa 
in Arabic script went through significant outside influences in the last century. 
The British in colonial West Africa introduced an additional letter, fa’ with three 
dots below to denote /p/ in ‘penny’. This symbol was only used on coins and 
has never been adopted as a part of Hausa literacy standards. A much more sig- 


19 The topic outside the scope of this chapter is word breaks. In the early stage of Hausa wri¬ 
ting in Arabic script, it was common to see the hyphenation (breaking the word) and disjointed 
writing of single words. The most consistent feature of this style was the disjoined writing of the 
words with a medial long /§/. In such words, the supporting yd’ was left unconnected with the 
next letter of the word, as in <bare wa> ( barewa ), ‘gazelle’. In modern Hausa Ajami such incon¬ 
gruences are very rare. 
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nificant outside initiative took place in the 1980s, when the ISESCO created an 
all-embracing system of Arabic-script transcriptions. The ISESCO model comple¬ 
tely ignored the traditions of Hausa Ajami, e.g. in the use of leaf with additional 
dots for /g/ instead of the common ghayn. As a result, the new transcription was 
practically ignored by the users of Ajami in Nigeria. Elsewhere in West Africa, 
the story was slightly, but not significantly, different. A few Islamic and Chris¬ 
tian publications appeared in the ISESCO orthography, mostly in Fula in (or for) 
Guinea, Mali and Cameroon. The current position of the Unicode consortium 
allows for the further use of Hausa Ajami both offline and on the Internet. 


5 Conclusions 

In this comparative overview of orthographic tendencies in the Kanuri and Hausa 
manuscript cultures we have tried to document ranges of orthographic variation 
and uniformity, identify some of their underlying factors and outline the graphe- 
mic sets particularly prone to standardisation. 

Both cultures had a standard model of the Arabic writing system whose uni¬ 
formity is grounded in Arabic literacy in its Qur’anic and Classical forms. There¬ 
fore, the concept of regulated uniform orthography was not alien to the scribes 
who added annotations in the vernacular to the standardised Arabic texts, often 
written by the same scribe. Nonetheless, the scribes of both cultures (and the 
claim can easily be extended to most of the Arabic-based writings discussed in 
this book) did not show any particular adherence to the principle of total ortho¬ 
graphic uniformity. The only noticeable restriction on variation is manifested in a 
conservative use of the Arabic letters, rather than in particular choices of letters 
and their combinations for representing Kanuri and Hausa sounds. The shared 
disfavour of invented graphemic designs is particularly observable at the earliest 
point in time when both cultures can be synchronically compared, that is, in the 
first part of the 19 th century - the period for which we have the earliest extant 
Hausa manuscripts. 

This conservatism is even more striking for Hausa because, unlike Kanuri 
which has fewer consonantal sounds than the number of consonantal letters in 
Arabic, the consonantal inventory in Hausa is larger than the graphemic inven¬ 
tory in Arabic (28 Arabic characters vs 23-26 consonants in Kanuri vs 32 conso¬ 
nants in Hausa (Newman 2000,392)). 

We cannot ascertain whether it was only pressure from the phonological 
system or some other (extra)linguistic factors which shaped later orthographic 
conventions in Hausa. What is clear in the history of Hausa Ajami writing is that 
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it developed more consonants to better suit its phonological system, and sets of 
standard rules gradually increased with time. 

The economical approach to the use of Arabic letters in the history of Kanuri 
writing is comparable to that of Hausa with the only difference being that, throug¬ 
hout the whole history of Kanuri Ajami writing, no new letters were invented. The 
main reason for that seems to be the smaller number of Kanuri consonantal pho¬ 
nemes. The veneration of the Arabic script - an attitude which cannot be ruled 
out but which is difficult to prove due to a lack of written evidence - may only 
partially explain the uninventiveness of the Kanuri scribes. The same respectful 
attitude to the letters of the Scripture did not inhibit the invention of new charac¬ 
ters in Hausa in later manuscripts, or in many other written traditions of Islamic 
sub-Saharan Africa. 

Some similarities between Kanuri and Hausa Ajami in their graphemic 
choices for writing certain sounds visibly identify both cultures as belonging to 
the same larger literacy area. Thus, both use <gh> for /g/, unlike the manuscript 
cultures to the west (for example Soninke) where the most common choice for 
/g/ is <k>. We call the former the ‘g-ghayri type of orthography as opposed to the 
‘g-kaf type. Another feature setting Kanuri and Hausa apart from the western 
cultures is the absence of tanwin signs used to indicate a word-final sequence of 
a vowel and a nasal consonant, such as -an, which, in Kanuri and Hausa writing, 
would be spelled with the diacritic sign fatha for [a] and the letter nun for [n]. 
Since we do not have manuscript evidence of Hausa writing before the early 19 th 
century, it is impossible to say whether Hausa had a different ‘western’ selection 
of orthographic choices before having closer contact with Kanuri. 

Apart from the similarities, our comparative analysis reveals some contras¬ 
ting features summarised as follows. One of the most striking differences between 
Kanuri and Hausa orthographies is the encoding of /o/. For Kanuri, a special spel¬ 
ling of the final /o/ is already attested in the early Old Kanembu manuscripts 
which is a combination of the damma + waw + alif surmounted by a sukun sign. 
Contrastively, the earliest known Hausa Ajami writing belonged to the ‘non-o’ 
type of orthography and specific spelling solutions for /o/ were only sporadi¬ 
cally tried by the scribes. In view of the fact that Hausa manuscript culture was 
significantly influenced by that of the Kanuri (the similar script style being the 
most salient feature), the reason why it was not until the 20 th century that Hausa 
scribes introduced a graphemic combination for /o/ similar to that used in the 
Kanuri manuscripts still needs to be explained. 

Both cultures have identifiable specific sets of grapheme-phoneme corres¬ 
pondences, which changed over time. From the diachronic perspective, such sets 
can be considered stable since it was only the mapping of a limited number of 
graphemes on a limited number of phonemes that changed. 
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In Old Kanembu writing, the phoneme /s/ was written as sin, then as sin and 
ta’ and then as sin, ta’ and sin, and later only as sin. The additional letters were 
also used for homorganically similar phonemes. Thus, the letter ta’ was also 
used for /ts/ and in the later manuscripts for /tf/, while the letter sin was also 
used for /§/. What is important here is that this graphemic-phonemic subset was 
quite stable over time within a given number of available choices, with only the 
mapping changing. Another such stable set with a restricted range of graphemic 
choices is observable for the phoneme /z/ which was written with the letters za\ 
dal and zal, or for the phoneme /k/ written as qaf ~ kaf. 

In Hausa Ajami, the salient stable sets can be identified as follows. At diffe¬ 
rent times and in different areas, the phoneme /tf/ was written with the letters sin 
(in earlier manuscripts in the area of Sokoto) and ta’ (in Kano manuscripts and 
later everywhere). The letter ta’ was also used for homorganically similar /s/ in 
Arabic loanwords which originally have /t/ (and are written in Arabic with ta’) 
and for the globalised /s ? /. Another set consists of the letters ra’ and dad and the 
phonemes /l/ and retroflex /[/. In earlier times, dad was used for /l/ and occasio¬ 
nally for hi. In modern Hausa Ajami, both rhotics are written with the letter ra’, 
whereas dad is only used in the etymological spelling of Arabic loanwords. 

Thus, we can identify sets of grapheme-phoneme combinations which were 
stable within a restricted range of the phonemic and graphemic inventories. The 
choices within such sets were conditioned by phonological, morphological, and 
mimetic factors, the latter being the etymological spelling of Arabic loan words 
and shared scribal practices manifested on the pages of the same manuscript. 

In more general historical terms, Hausa writing shows a tendency towards 
a closer match between the number of phonemes and graphemes, especially in 
more recent manuscripts. In the history of Kanuri writing, there is also a notice¬ 
able change in the choice of Arabic letters but there are two parallel tendencies. 
One is the diachronic rearrangement of grapheme-phoneme correspondences 
within specific grapheme-phoneme sets, while the other is the retention of the 
spelling of some high-frequency lexical and grammatical items. 
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Lameen Souag 

Kabyle in Arabic Script: A History without 
Standardisation 

Abstract: The history of writing Kabyle, the largest Berber variety of Algeria, is 
often reduced simply to the history of the development of its currently dominant 
Latin orthography; Kabyle before the 20 th century is often viewed as essentially an 
unwritten language. This oversimplification obscures an extensive, though some¬ 
what peripheral, corpus of Kabyle written in Arabic script starting before the 19 th 
century and continuing in some contexts up to the present. Unlike Berber varieties 
in some other regions, however, Kabyle shows few signs of ever having developed a 
standard Arabic-script orthography, and the transcription conventions used show 
little or no continuity between manuscripts from different periods. This lack of stan¬ 
dardisation reflects discontinuities in the region’s educational and political history, 
in combination with certain differences between Kabyle and Arabic phonology. 


1 Introduction 

As might be expected from Arabic’s long history as the dominant literary language 
of North Africa, Berber languages have frequently been written in Arabic script. 
In at least two regions during the pre-colonial era, such writing went beyond the 
sporadic quoting of words or poems and came to be used for a significant corpus 
of largely religiously motivated manuscripts: the Sous Valley of Morocco in the 
west, and the Ibadi areas of Libya and southern Tunisia in the east (Boogert 1997; 
Brugnatelli 2011). This contrasts strikingly with the most populous Berber-spea¬ 
king region between them, Kabylie in north-central Algeria. Relatively few Berber 
manuscripts in Arabic script have been reported from the area, even though it 
is known to have contained many zaouias (religious colleges) with substantial 
manuscript collections, and the colonial-era research tradition that provided 
Western academics with their first glimpse of the Sous and Ibadi manuscript tra¬ 
ditions (Luciani 1893; Calassanti-Motylinski 1905) treated Kabyle as essentially 
an unwritten language. In present-day Kabylie too, there is little awareness of 
any history of writing Kabyle in Arabic script; Haddadou (2004), of the Univer¬ 
sity of Tizi-Ouzou, devotes a whole section of his L’alphabet berbere to the Arabic 
alphabet, alluding to Moroccan, Libyan, and Mozabite examples, yet mentions 
no examples of its use for Kabyle. 


3 Open Access. © 2019 Lameen Souag, published by De Gruytei. |E223H This work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
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Contrary to the impressions of colonial-era researchers and modern Algeri¬ 
ans alike, however, several Kabyle manuscripts in the Arabic script survive. They 
are strikingly similar in their content and purposes to the better-known Sous and 
Ibadi materials, although rather less numerous. Moreover, the practice of writing 
Kabyle using Arabic script has continued in some contexts even up to the present 
day, despite being rejected by most activists. Nevertheless, a close examination 
of the orthographic conventions used reveals sharp discontinuities. While indivi¬ 
dual materials produced in similar circumstances show some signs of standardi¬ 
sation, such embryonic conventions have rarely been maintained for long. This 
lack of standardisation, and the rarity of Kabyle manuscripts in Arabic script rela¬ 
tive even to other Berber varieties, seem to be the result of two factors: the sharp 
discontinuities in the region’s educational and political history over the past two 
centuries, and the specific differences between Arabic and Kabyle phonology. 


2 Contexts of literacy 

By the start of the 19 th century, the region of Kabylie, largely independent of the 
nominally Ottoman central government in Algiers, was covered by an extensive 
network of zaouias (timeemmert in Kabyle) - religious colleges providing a brief 
primary education to large numbers of children, and a higher education to the 
minority who stayed longer. Their curriculum included literacy, memorisation 
of the Qur’an, Arabic grammar, Sufism, Maliki law, hadith, interpretation of the 
Qur’an, arithmetic, geometry, astronomy, and poetry (Daumas 1847, 61-63). Col¬ 
loquial languages were of little concern to this educational system, whose core 
texts were in Classical Arabic and whose scholars in turn normally wrote in Clas¬ 
sical Arabic. However, an important function of graduates was to provide reli¬ 
gious guidance in Kabyle to non-Arabic-speaking lay people, and in some cases 
this motivated them to write down Kabyle texts of their own composition. 

The French conquest, starting in 1830 with the occupation of Algiers but not 
encompassing the Kabyle heartland until 1857, had a disruptive effect on this 
system from the start. Parts of the elite fled to Syria, preferring exile to French 
rule, and villages and mosques were destroyed in the process of conquest (cf. 
Hanoteau 1867, 144). In order to govern the newly conquered areas, the French 
state found it necessary to study Kabyle, resulting in a flurry of publications 
incorporating Kabyle text to varying degrees; these were not, however, aimed at 
a Kabyle audience. 

In 1871, Shaykh El-Mokrani and Shaykh Aheddad led a major rebellion against 
France, which at its peak involved not only their native region, eastern Kabylie, but 
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most of central and eastern Algeria. When France finally defeated them in 1872, 
the consequences for the region were dire. A heavy punitive tax was imposed, 
beggaring many families, and lands owned by tribes and foundations - including 
zaouias - were confiscated by the state. Leading rebels, including major religious 
scholars, were exiled or executed, and restrictions were imposed on Islamic edu¬ 
cation. In such a situation, the resources available to the zaouias, and the pros¬ 
pects for their graduates, were a fraction of what they had formerly been. 

Meanwhile, a new educational system was beginning to provide more attrac¬ 
tive prospects. Between 1873 and 1883, the Peres Blancs opened eight Catholic 
schools in Kabylie; the French government followed suit in 1882, opening another 
eight state schools (Direche 2007). In these schools, the language of instruction, 
and more generally of literacy, was French; together with their modern vocatio¬ 
nal focus, this qualified their graduates for subaltern positions within the colo¬ 
nial economy. These schools were attended by only a small minority, but their 
graduates’ relative prosperity gave them an importance out of proportion to 
their numbers. Starting in the early 20 th century, the importance of French was 
increased by labour migration to France, as remittances became a major eco¬ 
nomic resource for Kabyie villages. Somewhat later, the Association of Muslim 
Ulama, founded by Ben Badis in 1931, attempted to redress the balance by expan¬ 
ding Islamic education and shifting it towards a modernised curriculum taught 
in Arabic (Heggoy 1973). 

In 1948, the Algerian literacy rate, excluding French citizens, was only 6% 
(United Nations 1955, 436). It was left to the War of Independence (1954-1962) 
to change this situation. The outbreak of the revolution was itself enough to 
provoke belated reforms; with the Constantine Plan of 1958, France attempted 
to appease and remould Algerian public opinion by, among other things, buil¬ 
ding new schools and increasing enrolment. After independence in 1962, the 
pace of change accelerated: the new Algerian government extended education 
throughout the country, making literacy one of its highest priorities. The lan¬ 
guage of instruction was initially usually French, since, at independence, most of 
the small minority of graduates in subjects other than Arabic and Islamic studies 
had studied in French and had little command of Standard Arabic. However, the 
government pushed hard to change the language of instruction to Arabic, despite 
opposition from some of those educated in French. By the end of the 1980s, all 
levels up to the end of secondary school, as well as many university subjects, 
were taught in Arabic. 

Berber, however, was treated rather differently. The post-independence 
government viewed the idea of Berber identity as divisive, and saw no place for 
the language in education; the only appropriate arenas for its public use were oral 
contexts such as radio broadcasting or music, and even there it was put under 
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pressure. In combination with other factors, this helped provoke increasing hos¬ 
tility in Kabylie to the government’s education policy; one of the era’s first large- 
scale protests, in 1980, was triggered by the government’s banning a university 
lecture on Berber poetry. Massive emigration from rural areas to the largely Arabic¬ 
speaking cities throughout this period led to extensive language shift among emig¬ 
rants’ children, compounding the feeling that Kabyle was under threat. 

In 1988, the government was forced by budgetary problems and popular 
protests to open up the political arena to independent parties and newspapers, 
creating new options for the use of Kabyle in writing. In 1995, after a large-scale 
school boycott in Kabylie compounded the multiple crises then facing the state, 
it conceded the gradual introduction of Tamazight teaching into selected schools 
(Zeroual 1995). Over the early 21 st century, it went further, declaring Tamazight 
first a national language (Bouteflika 2002), and finally an official one (Bouteflika 
2016). In theory, ‘Tamazight’ refers to a language of which all Berber varieties are 
considered to be dialects. In practice, the Tamazight of most government text¬ 
books and of books by Kabyle speakers - which make up the large majority of 
Berber publishing in Algeria - is simply Kabyle with a number of Arabic or French 
loans replaced by neologisms. Even now, Tamazight occupies a very limited place 
in the educational system; it is taught as an optional subject, not available in 
all schools, and has not been adopted as a language of instruction for any other 
subject. However, Tamazight-language publishing, made possible by the post- 
1988 liberalisation, continues. 


3 Sources 

No list of Kabyle works in the Arabic script can be guaranteed or even expected 
to be complete. Many potentially relevant collections remain uncatalogued, and 
the observed domains of Arabic script usage include contexts, such as private 
letters or political posters, which are rarely preserved systematically. Neverthel¬ 
ess, enough material is available from the past two or three centuries to yield a 
general picture. The distribution of this material can only be understood in the 
light of its educational and political contexts. 
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3.1 Indigenous efforts in the pre-colonial and early colonial 
periods 

3.1.1 The zaouia tradition 

The manuscript libraries of the zaouias of eastern (‘Petite’) Kabylie have yielded 
a small but significant number of manuscripts in Kabyle. None include dates, 
unfortunately; some can be shown to date to the 19 th century, while others are 
thought to predate this period, but in most cases no solid evidence is avail¬ 
able. The best documented ones are the religiously focused materials from the 
collection of Shaykh Lmuhub Ulahbib of Tala Uzrar (near Bejaia), born in 1822 
(A'issani 1998), some of which may predate the 19 th century. 476 manuscripts in 
this collection have been catalogued (Mechehed 2007), although this does not 
reflect the full picture, since some manuscripts have passed into other hands, 
and many were burned by French troops in 1957. Most of this collection is in 
Arabic, but it includes five Kabyle manuscripts: three copies of translations of 
al-Sanusi’s Creed (KA 21, KA 22, TIA 08), of which the oldest (KA 21, Fig. 1) has 
been addressed in detail by Gutova (2011); an anonymous poem about the virtue 
of knowledge by Said Ali Cherif, born 1820 (LIT 21/2); and a child’s note (DVS 09). 
Luciani (1893, 159) attests to the existence at his time of a Kabyle translation of 
the same work at the zaouia of Sidi Yahia ben Hamoudi, among the Ait Ourthi- 
lane (which he describes as the only example of a Berber Islamic manuscript in 
Algeria). GEHIMAB (2006,5) mentions another Kabyle creed in manuscript found 
at the zaouia of Tilmiouine (Ouzellaguen), and the Ministry of Religious Affairs 
(2009,24) gives a barely legible photograph of another theological manuscript in 
Kabyle, cited as belonging to the family of Shaykh Oubelkassem of the zaouia at 
Boudjelil (bombed by the French army in 1956). Another religious poem, from the 
library of Shaykh Aheddad, is mentioned in Ai'ssani (2012). Manuscript no. 3066 
of the Bibliotheque Nationale d’Algerie is described by GEHIMAB (2006, 2) as a 
10-page verse text on fiqh in the Kabyle dialect of the Soummam valley; the photo 
is illegible. 

A rather different genre, equally motivated by the needs of zaouia students, 
is represented by the Algerian Arabic-Kabyle wordlist from the library of Cheikh 
Aheddad, partly reproduced by GEHIMAB (2006,5) and transcribed and analysed 
by A'issani (2012). Zaouias often attracted students from outside the region, who 
would have needed to learn Kabyle for practical purposes. 

Many early North African Arabic-language texts, such as the early 18 th century 
pharmacopoeia of Ibn Hamadush (2001, 57, 83), contain scattered words in Berber, 
commonly proper names or plant names. While such transcriptions are of limited 
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Fig. V. First page of manuscript KA 21 from the Lmuhub Ulahbib Collection (Taia Uzrar, Ain- 
Legradj, Bejaia, Algeria), a Kabyle translation of al-SanusT‘s creed. © Djamel-Eddine Mechehed. 
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value, they too may serve as evidence for the development of orthographic conven¬ 
tions. The Voyage of al-Wartilani (1994), in which an 18 th century Kabyie religious 
scholar recounts his travels, is of particular interest in that its original form presuma¬ 
bly reflected Kabyie zaouias’ practices; however, a study of the author’s orthographic 
choices would have to be based on manuscripts rather than on later printed versions. 


3.1.2 Customary law codes 

At least since the Ottoman period, some Kabyie villages created manuscripts summa¬ 
rising their customary law, or qanun, a phenomenon recently surveyed by Gahlouz 
(2011). These were most often written down in Arabic, e.g. the qanun of the Maatka 
copied in 1859 (Millot 1922), or the pre-colonial qanun of the Ait Hichem dating to the 
1830s (Millot 1928). Occasionally, however, they were written in Kabyie; the present 
survey examines the late 19 th century qanun of the Ait Ali Ouharzoune scanned by 
Brugnatelli (2006) and partly translated (Anonymous 2009). Another reported 
example is that of the Ait Ouagennoun (Bernard and Millot 1933; Ould-Braham 1986). 
Unlike the zaouia manuscripts, these cases come from western (‘Grande’) Kabylie. 


3.2 Colonial efforts 

3.2.1 Language teaching materials 

Given that Europeans and Americans are typically much more familiar with the 
Latin alphabet than with the Arabic one, it may appear paradoxical that text¬ 
books and articles intended for their benefit made frequent use of the Arabic 
script. This fact, however, is important for the light it sheds on contemporary 
Kabyie practice. The earliest cases justify this practice on the basis of the spea¬ 
kers’ own habits; thus Hodgson (1834, 38) writes somewhat paradoxically ‘The 
Berber language, having no literature, has properly no alphabet of its own. It is 
written nevertheless by means of the Arabic characters, which are generally used 
for that purpose’, while the dictionary of Brosselard (1844, II) - produced by a 
committee including one Kabyie, Sidi Ahmed ben el Hadj Ali of Bejaia - describes 
the Arabic orthography used as ‘the system of transcription that appears most 
widely accepted’, calling it ‘the Berber alphabet’ (author’s translations, here and 
henceforth). Later works are more dismissive, describing Kabyie as an unwritten 
language (Delaporte 1836, 89; Ben Sedira 1887, III; Boulifa 1897, II), and presen¬ 
ting the writing of Kabyie in Arabic script as an occasional makeshift of the mara¬ 
bout class rather than a well-rooted practice. General Hanoteau’s (1858) influen- 
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tial grammar of Kabyle did much to popularise this view: he states that ‘[t]hose 
who speak it borrow its characters from the Arabic alphabet, when they want to 
express their ideas in their language in writing, which they moreover do only very 
rarely, and always with a certain distaste. Kabyle, they say, is spoken, not written’ 
(Hanoteau 1858, 1-2), and later added for good measure that 'All efforts made 
to find a book written in this language have been, and no doubt always will be, 
fruitless’ (Hanoteau 1867, II). 

Nevertheless, Hanoteau continued to include Arabic transcriptions in parts of 
his works, justifying them on the grounds that they make it possible for the learner 
to get a speaker of Berber to read the words written, and thus to correct the learner’s 
pronunciation: ‘This transcription, nonetheless, will not be lacking in utility for 
people who want to study the Kabyle language; it will allow them, in effect, to get 
the texts read for them by a Kabyle’ (Hanoteau 1867, XII). The practicality of this solu¬ 
tion is confirmed by other data. Rinn (1887) presents two Kabyle poems attacking 
Cheikh Aheddad, anonymously sent in Arabic script to a French general in 1872; 
these suggest that, at that period, even the small minority of Kabyle speakers who 
wanted to be seen as vocally pro-French were more accustomed to Arabic script. 

The prolific output of Rene Basset would keep the practice of Arabic 
transcription going for some decades longer. In his Manuel de langue kabyle 
(1887), as throughout his extensive publications on Berber, he used Arabic cha¬ 
racters alongside Latin transcription, on the grounds that (p. VIII) ‘[T]he natives, 
[wjhatever one may do, will not abandon the use of the Oriental characters’, and 
that it allows problematic short vowels to be ignored in writing. In the meantime, 
however, this justification was becoming unnecessary: the demands of French 
administration were creating a minority of Kabyles literate in French. 

For this new group, uncomfortably poised between the indigenous and French 
societies, the use of Arabic script could no longer be taken for granted. In a short 
grammar, Ben Khouas (1881, 5-6), presents a paragraph-long argument to con¬ 
clude that ‘[sjince [...] the Kabyles have much more frequent relations with the 
French than with the Arabs, especially when there is occasion to write something, 
it is preferable in every way, for writing Kabyle, to use French characters’. Text¬ 
books by others in this situation at the same period, such as Ben Sedira (1887) and 
Boulifa (1897), dispense with Arabic characters entirely, or, like Boulifa (1913,351), 
confine them to a single page by way of preparation for examinations. The same 
trend is observed in contemporary French linguists’ work; Moulieras (1892; 1893) 
uses Latin characters exclusively to transcribe the Kabyle stories dictated to him by 
informants he describes as illiterate, as does Gourliau (1893), while Luciani (1899, 
33) publishes only the Latin transcription of poems he had actually received in 
Arabic transcription. In 20 th century French-language works on Kabyle, whether by 
Frenchmen or by Algerians, Arabic characters are almost uniformly absent. 
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3.2.2 Missionary materials 

Insofar as Christian missionaries’ choice of orthography reflects their perception 
of existing usage, it casts light on Kabyle orthographic preferences in a similar 
way to language learning materials. However, sectarian factors also appear 
to have been relevant; Catholic sources consistently avoid Arabic script in this 
region, while Protestant ones show greater variability. 

The earliest attempts to render parts of the Bible into Kabyle, from Arabic, 
took place at the behest of the American vice-consul Hodgson. The (Muslim) 
translator he commissioned, the marabout ‘Si Hamet’ of Bejaia, wrote his transla¬ 
tion in the Arabic script. Parts of these (Anonymous 1833) were printed in a modi¬ 
fied Arabic script and distributed in Algeria shortly after the French conquest, 
while another portion was later printed for purely linguistic uses (Newman 1847). 

Serious efforts at Christian proselytisation in Algeria started a generation 
later, in 1868, when the newly appointed Catholic Archbishop of Algiers, Cardinal 
Lavigerie, founded the Peres Blancs; starting in 1873, they also established village 
schools in Kabylie (Direche 2007). From the start, their efforts, liturgical (Alle- 
mand-Lavigerie 1868; 1869) or lexicographic (Creusat 1873; Olivier 1878; Huyghe 
1901), made exclusive use of Latin characters, again characterising Kabyle as an 
unwritten language. 

Protestant evangelisation - initially based at Djemaa Saharidj near Tizi-Ouzou 
- largely followed the Catholic lead in this respect, justifying the choice on the 
grounds that French schools were in any case spreading literacy in the Latin script 
by then (Rutherfurd and Glenny 1900,146; Cuendet, Hocart and Lamb 1893). Never¬ 
theless, they hedged their bets with a supplementary printing of Mark and Luke 
in Arabic script (British and Foreign Bible Society 1904), specifically intended for 
students from traditional Islamic schools (Canton 1904,16): ‘Stirred by the energy 
of western life, the marabouts taught the Kabyle lads the Arabic character, and for 
these a transliteration of St Luke was in progress’. This would be the last missionary 
publication in Kabyle using Arabic script, as far as I have been able to find. 


3.3 The early 20 th century 

3.3.1 Poetry 

One of the earliest and most important printed works by a Kabyle author aimed 
partly at a Kabyle audience, Boulifa’s (1904) anthology of Kabyle poetry, uses 
both Latin and Arabic characters. Since the author’s pedagogical works intended 
for Frenchmen do not include Arabic script, this choice was most likely intended 
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to make the poems legible to a wider audience of literate Kabyles. Yet the expan¬ 
sion of French education in the region starting from 1880 (Direche 2007), along 
with the closing of zaouias, would leave the Arabic characters meaningless to 
later generations. Mouloud Feraoun (1960,10-11) bears eloquent witness at once 
to Boulifa’s success in appealing to a Kabyle audience and to the change in edu¬ 
cational patterns that made this practice obsolete: 

They are incapable of reading the poems gathered by Boulifa in the Arabic transcription; 
if they tackle the French transcription, they can be sure of reading them well only when 
it comes to a poem that they already know [...] It is “the Book”, the only book of young 
Kabyles. You find it in the villages, old and venerable, as it was published more than half a 
century ago [...] It is never complete; a number of its missing pages have been offered to a 
friend, to continue their clandestine career in others’ hands. 1 

The use of Arabic script would still be continued, however, by those Kabyles who 
opted for a traditional or reformist (Ulama) education. Yacine (1987) presents an 
extensive corpus of reformist Islamic and nationalist poetry in Kabyle by Qasi 
Udifella, originally written down in vocalised Arabic letters by Hadj Boubekar in 
the 1940s in the Biban region in the extreme east of Kabylie. The choice of Arabic 
characters was natural, given their political alignment. 


3.3.2 Popular music 

In the 1950s and 1960s, Kabyle song titles (and sometimes lyrics) could often 
be found in Arabic script, as can readily be seen by examining the LPs of Cherif 
Kheddam, Slimane Azem, or El Hasnaoui. In fact, Cherif Kheddam, educated at 
the zaouia of Boudjelil in eastern Kabylie, often wrote his own lyrics in Arabic 
script (GEHIMAB 2006). This minor domain of Arabic-script Kabyle becomes less 
prominent in the 1970s (Idir’s albums, to take an influential example, simply use 
Latin script), and later vanished almost completely. 


1 Les poemes recueillis par Boulifa, ils sont incapables de les lire dans la transcription arabe ; 
s’ils s’attaquent a la transcription franchise, ils ne sont surs de bien lire que lorsqu’il s’agit d’un 
poeme qu’ils connaissent deja [...] II est “le Livre”, l’unique livre des jeunes Kabyles. On le trouve 
dans les villages, vieilli et venerable, tel qu’il fut edite, il y a plus d’un demi-siecle [...] II n’est 
jamais complet: nombre de ses feuillets manquants ont ete offerts a un ami pour poursuivre en 
d’autre mains leur carriere clandestine. 



Kabyle in Arabic Script: A History without Standardisation 


283 


3.4 Official recognition 

The post-independence Amazigh identity movement, reacting against the uniform 
Arab identity urged on them by the Algerian state, conspicuously avoided the use 
of Arabic script. The Neo-Tifinagh script proposed by the Academie Berbere quickly 
attained emblematic status for signs and mottos, while the Latin orthography later 
popularised by Mouloud Mammeri (1976) came to be generally adopted for longer 
works. However, political liberalisation following 1988 created arenas outside 
of Amazigh activism for the written use of Kabyle; already during the 1989-1990 
election campaigns some parties created posters with slogans in Kabyle in Arabic 
script (not examined here). Soon afterwards, the Algerian state, facing intense pres¬ 
sure on multiple fronts, offered the Amazigh activists a limited compromise. Since 
1995, the Algerian state has provided limited teaching of Tamazight (conceived as a 
single language) in selected schools; since 2002, Tamazight has been recognised as 
a national language, and in 2016 it was declared an official one as well. 


3.4.1 Educational materials 

The decision to teach Tamazight required the creation of textbooks and dictio¬ 
naries acceptable to the state. As such, it brought previously unconcerned parties 
into the orthography debate. While the use of Arabic script was by and large 
strongly opposed by Amazigh activists, particularly in Kabylie, it was strongly 
favoured by significant segments of the state and of Algerian public opinion 
outside the region. The dictionary of Tagamount (1995), published just in time for 
this decision, reflects the debate: while primarily using Latin characters, it pro¬ 
vides an Arabic transcription in the Kabyle-French/Arabic section, alongside the 
Arabic translations. This grudging concession to Arabic would not be followed 
up in subsequent dictionaries, however: while Kabyle dictionaries have since 
become a routine sight in Algerian bookstores, even those that give Arabic trans¬ 
lations almost always avoid giving Arabic transcriptions. 

Notwithstanding the widespread preference for Latin script (Sini 2004), 
Kabyle supporters of the Arabic script can still be found. A notable example 
is Salah Belaid, of the University of Tizi-Ouzou. He devotes nearly half a book 
(Belaid 1999) to the issue of which writing system to choose for Berber, coming 
down strongly in favour of Arabic. By way of examples, he gives a text transcribed 
into Tifinagh, Latin, and Arabic characters (159-166) and a glossary of neologisms 
drawing extensively on previous authors’ work (215ff). Like most other writers on 
the subject of which script to choose for Kabyle, however, he does not appear 
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to have published any books in Kabyle, so there is little incentive for would-be 
readers to learn his orthographies. 

Up to the present, the disagreement over script choice has been resolved by 
making official materials for the study of Berber available in three scripts: Latin, 
Arabic, and Tifinagh. In practice, Latin script overwhelmingly predominates in 
Kabylie, but official examples of the use of Arabic script can still be seen, e.g. 
Ministry of Education (2003), Hrouch et al. (2010). As will be seen below, the 
Arabic orthography used in these textbooks, described in detail by Gaci (2011), is 
rather eccentric, being chosen basically to maximise compatibility with the Latin 
transcription, and does not correspond to any of the practices seen elsewhere. 


3.4.2 Religious materials 

The post-1995 climate of relative openness to Tamazight affected not only the edu¬ 
cational system, but also a domain whose control is almost equally important to the 
state: Islamic religious publications. The preference for Arabic orthography was, 
unsurprisingly, particularly strong in this domain - although even here, non-state- 
funded publications have used Latin script. A group led by the Kabyle religious 
scholar Si Mohand Tayeb has published first selections from the Qur’an (2003) and 
later a full translation (2012), printed in Saudi Arabia with funding from Algeria’s 
Ministry of Religious Affairs. In this work the Arabic script is used, with some addi¬ 
tional characters. The decision to write in Arabic script attracted criticism from 
Kabyle activists, but was vigorously defended by the author (as quoted in Chachoua 
2010): ‘My principal long-term goal is to defend Berber against those who want to 
turn it away and distance it from Islam. [...] I have often, in many published articles 
and letters to various political authorities, called for the adoption of Arabic letters 
for writing Berber, and I would do the impossible to make this happen’. Rebahi 
(2009), in a booklet funded by the same ministry, gives two short religious poems 
in Kabyle, written in Arabic script, with translations. 


3.4.3 State media 

A national radio channel using Tamazight (mainly but not exclusively Kabyle), 
Radio 2, has been maintained since independence. The question of script choice 
hardly arises for broadcasting, but since at least 2011 this channel has also had a 
webpage (Radio 2) at the national radio service’s website. At no point has this page 
been translated into Kabyle or any other Tamazight variety. From 2011 to 2013, it 
was bilingual in Arabic and French, with Kabyle programme titles and headings 
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transcribed in Arabic script on the Arabic site and in Latin script on the French 
site. By 2014, this was no longer the case: since then the Radio 2 page has been 
available only in French, not in Arabic, and the programme titles and headings 
(still in Kabyle) are transcribed in Latin script (and later also Tifinagh). The only 
remaining trace of Arabic script in 2014 was the station motto at the top of the page, 
transcribed in Tifinagh, Arabic script, and Latin script - and by 2017 this had been 
replaced by the name of the station in Tifinagh script plus translations into French 
and Arabic. More recently, a Tamazight TV channel (TV 4) has been created; this 
station too makes frequent use of Arabic script to transcribe Tamazight titles or 
slogans, including Kabyle ones, but the texts in question remain very short. 

In 2013, the state-owned primarily Arabic-language newspaper Ech Chaab, 
founded in 1962, started carrying columns in Tamazight written in Arabic script 
(Rubrique Radar 2013). Some, but not all, of these columns were in Kabyle, e.g. 
Lakhdari (2013). 


3.4.4 Books on heritage 

The most conspicuous contributor in this domain is the historian and ex-FFS 
deputy Mohamed Arezki Ferad. His history of his native Azeffoun (2003), pri¬ 
marily written in Arabic, includes many Kabyle poems and sayings written in 
Arabic script. In his study of Tipasa Berber proverbs (Ferad 2004), an attempt to 
document the oral traditions of a Berber dialect falling outside the Kabyle conti¬ 
nuum proper, he acknowledges that the Latin script has taken the lead at present, 
but vigorously defends his choice of the Arabic script, arguing that ‘in view of 
the richness of the Berber cultural stock recorded in Arabic characters (books, 
contracts...) and the cultural factors that have brought together the Berber and 
Arabic languages, the Arabic script is a candidate for filling the principal role in 
the advancement and development of Berber in the future’ (63). The author has 
gone on to produce other books focused on Kabyle heritage, including an over¬ 
view of the region (Ferad 2007) and a study of Kabyle Sufi poetry (Ferad 2011). 
More academically oriented Arabic-medium studies of Kabyle folklore also some¬ 
times make use of Arabic script, such as the thesis of Tabarkan (2012). 


4 Orthographic conventions and their contexts 

The number of texts written in Kabyle in the Arabic script, while not enormous, 
is sufficiently large to raise the question of standardisation. How have different 
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authors dealt with the problem of representing phonemes with no direct Arabic 
equivalents? To what extent have they been influenced by one another’s conven¬ 
tions, or more generally adopted similar conventions? When and in what milieux 
have standards, if any, emerged, and how long have they been maintained? 


4.1 Challenges for orthography design 

All of the phonemes of Classical Arabic as pronounced in Algeria are to be found 
in Kabyle except the stop d (in eastern varieties d is also missing). Since Arabic d 
is usually pronounced d in the region except in careful speech, this gives authors 
a choice between d and d to represent the phoneme d. Kabyle, however, also has a 
number of consonant phonemes lacking in Classical Arabic, most conspicuously 
g and z (shared with Algerian Arabic). It distinguishes affricates c, g (mainly in 
geminates) from fricatives s, z (conventionally transcribed c, j), while Arabic does 
not (depending on the underlying dialect, Arabic gim is regionally realised either 
as g or as z) . It also marginally distinguishes affricates ts, dz (the former is usually 
derived from underlying geminate ft, while the latter is very rare) from both t, d 
and s, z, while Arabic has no phonemic affricates. It has a broader distinction, 
usually predictable but contrastive in some contexts, between stops ( b, t, d, k, g) 
and spirants ( b, t, d, k, g); of the spirants, only t and d have Arabic counterparts. 
The distinction between plain l, r and emphatic l, r is marginally contrastive both 
in Kabyle and in Algerian Arabic (for l it is arguably contrastive even for Classical 
Arabic), but is consistently not made in Arabic script; that between s and s is even 
more marginal in Kabyle, and absent in Arabic. Kabyle also has a series of labio- 
velarised back consonants {k w , g w , x w , y w ), with no direct equivalent in Arabic, 
but often corresponding in loanwords to Arabic short u. Of these, note that the 
affricates ts, dz and the spirant series are not phonemic in Shilha and in Libya, 
simplifying the situation faced by would-be writers there. 

The vowel inventory of Kabyle is rather smaller than that of Classical Arabic; 
whereas Arabic distinguishes short a, i, u from long a, f, u, Kabyle contrasts one 
lax vowel, a (conventionally transcribed e), with three tense ones, a, i, u. (In 
Shilha, the lax vowel no longer has any phonemic status.) Mapping the Kabyle 
system onto the Arabic one poses some difficulties. In Kabyle, the tense set are 
phonetically long in open syllables when stressed, and shorter elsewhere; this 
encourages a phonetic rather than phonemic transcription into Arabic charac¬ 
ters. The lax vowel, however, is consistently short, so overtly representing length 
tends to create ambiguity with it. Moreover, word-finally, Arabic distinguishes a 
{alif mamdudah) from a {alif maqsurah, written with an undotted yd’) on purely 
morphological grounds not directly paralleled in Kabyle. 
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4.2 Observed solutions 

Table 1 sums up the observed responses to the most important of the issues in 4.1, 
by source (for the numerals under ‘Tense vowels’, see 4.2.2). Note that eastern 
Kabyie varieties, used in several of these sources, do not have a phoneme d. Only 
a limited selection of the Bible translation of 1904 was available. 

Table 1: Orthographic choices by source. 


dgz tsc bkgC w Tense a 

vowels 


Zaouias: 

KA21 (Gutova 2011) - £ ? 

Poem in honour of Aheddad (GEHIMAB^ £ j 

2006, 3) 

Aheddad wordlist (GEHIMAB 2006. 5, ^ j 

ATssani 2012) 

Customary law: 

Ait Ali Ouharzoune (Brugnatelli 2006, ^ j 

Anonymous 2009) 

Language teaching: 

Brosselard (1844) j 

Hanoteau (1867) *4 j 

Rinn (1887) - £ j 

Basset (1887) ^ j 

Bible translations: 

Anonymous (1833) - j j 

BFBS (1904) £ 

Early 20 th century: 

Boulifa (1904) i> j j 

Udifella (Yacine 1987) j j 


£ j o-“ 5 




<A il ? 


il - 3 

d li . 2 

£ £ ? 3 

*4 il j, - (various) 


*4 £ j, - 4 
*4 3 ? 3 


Educational materials: 

Tagamount (1995)- 
Belaid (1999) - text 
Belaid (1999) - lexicon 


m *4 £ 

6 3 ? 


Hrouch et al. (2003) £ 5 ° u 1 ' m ^ 3 

Ministry of Religion: 

Anonymous (2003), Tayeb (2012) £ j ^ g 

Folklore: 

Ferad (2003) £ j ^ *4 £ 


4 
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Other minor points of variation have been omitted from the table. These rarely 
relate to consonants, although one source - the first Bible translation (Anony¬ 
mous 1833) - makes the particularly eccentric choice to represent t with a new 
character seemingly based on Greek theta: a fa’ shape with a dot inside the loop 
(not in Unicode). However, earlier sources use the maghrihl script, while modern 
ones use naskhl ; this entails a difference in the dotting of fa’ (under in maghribi, 
over in naskhl). 

More commonly, differences relate to vocalisation. Initial geminates are often 
written with al-, particularly for coronal consonants. Some sources, such as the 
qanun of Ait Ali Ouharzoune, the poem in honour of Shaykh Aheddad, and the 
poetry of Udifella, frequently write word-final -en using tanwin (-an). Final -a is 
sometimes written with a or even with ta’ marbutah, and final -u with waw+alif 
The use of saddah to mark gemination is not always consistent. 

The textbook of Hrouch et al. (2003) stands out in a number of respects. 
While other sources normally transcribe initial epenthetic schwa using an alif, 
this textbook systematically omits it. Likewise, most sources write initial i and 
u with alif (usually followed by yd’ or waw), but this textbook uses ya’+hamza 
and waw+hamza. Almost all sources write clitics as part of the same orthographic 
word as their host, whereas this textbook separates them with a hyphen. In all 
these respects, and in others to be seen below, this textbook caiques the Latin 
script orthography that was dominant by the time of its publication. 


4.2.1 Consonants 

A majority of the consonants are shared between Arabic and Berber; as a result, 
most consonants are represented identically in all texts. The exceptions, however, 
are informative. 

The consonant d, where present, is overwhelmingly written as dad, whose 
pronunciation (except in careful speech) is the same and which has a far higher 
frequency in Arabic than da ’. The only sources to use da ’ consistently are the book 
of Belaid, a linguist teaching Arabic, and the Qur’an translation, whose authors 
had presumably been trained in recitation and hence in careful pronunciation of 
Classical Arabic phonemes; both would be expected to be unusually sensitive to 
the distinction in “proper” Arabic pronunciation between d and d. 

In sources from the 19 th century or earlier, g is normally represented by a kaf, 
with or without three extra dots. The choice to base the letter for g on k corresponds 
to traditions established in the region much earlier (cf. Ibn Khaldun (1969, 32), 
van den Boogert (1997)). However, a three-dotted qaf is overwhelmingly predo¬ 
minant in 20 th century sources, anticipated by the Bible translation of 1833 (but 
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not by Newman’s (1847) transcription from the same source!) and by Rinn (1887). 
This reflects the influence of bilingualism in Algerian Arabic, where the reflex of 
classical qaf is often g rather than q, and corresponds to normal modern Algerian 
practice in Standard Arabic when transcribing place names. 

No efforts are made to distinguish z from z until the late 20 th century - and 
even in those texts, actual usage is not entirely consistent. Hrouch et al.’s use of 
zay with a circumflex may be a rare instance of Belaid’s work influencing others, 
but may represent independent innovation; in either case, a possible source of 
inspiration at this late period is Sorani Kurdish, which uses g. and J. The Qur’an 
translation’s use of a zay shape with three dots follows in the footsteps of Chafik 
(1990) and Baamrani (2003) for Moroccan Berber, and takes advantage of this 
character’s presence in Unicode and in major digital fonts. 

Very few sources seem to have felt any need to distinguish ts from its usual 
underlying source tt. Of those that do. Belaid (who adds a small sin on top of the 
letter ta’) and the Qur’an translators (who use a cluster t+s) have already been 
observed to be particularly sensitive to minor details of pronunciation, while 
Anonymous (1833) may well have created the character (the letter shape of sin 
plus the two dots of ta’) in response to the requests of the American consul who 
commissioned him. 

The low textual frequency of c makes it difficult to generalise with confi¬ 
dence, but in texts predating the 1980s, it is normally represented as sin except in 
works produced for foreigners .The latter often (but not always) use jim with three 
dots, a letter borrowed directly from Ottoman Turkish. Among more recent works, 
Ferad and the Qur’an translators represent it as a cluster {t+s), again correspon¬ 
ding to common practice in Arabic transcriptions of place names in Algeria, while 
Hrouch et al. add a hacek to sin, probably calqued from the Latin orthography. 

The treatment of spirants is further complicated by the existence of variation 
within Kabyle; not all dialects necessarily have b, for example. In any case, we 
find that 19 th century materials consistently write ba’ in the positions where we 
might expect b. Most 20 th century sources do the same, but Udifella and the Qur’an 
translators both represent it as a ba’ shape with three dots below - it is unclear 
whether the latter got it from the former, but in view of the religious content of 
Udifella’s poetry it cannot be excluded. Belaid is strikingly inconsistent on this 
point (and the combination of qaf with three dots for g and fa’ with three dots for 
b that he uses in his sample text creates intolerable systematic ambiguity.) 

Spirant k and g are almost never distinguished from stops k and g. Belaid 
makes sporadic attempts to do so, but the only source systematically distingu¬ 
ishing them is the Qur’an translation. The novelty of the latter’s choice is clear 
from the signs chosen. The choice of jim with three dots for g can only reflect awa¬ 
reness of the Egyptian pronunciation of jim as g, an awareness that would have 
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been unlikely in earlier periods, while that of /cd/with a line on top for k likewise 
reflects the assumption that an Algerian audience will not be familiar with the 
usage of this letter for g, an assumption which is certainly correct at present but 
which would have been much less well-founded in the 19 th century, when some 
literate people would likely have been aware of Ottoman Turkish practice. 

There is little consistency in the representation of labiovelarisation even 
within a single text, but insofar as it is represented it is usually represented using 
a damma. Basset (1887) and Boulifa (1904) occasionally use a wdw instead. 


4.2.2 Vowels 

Due to the system incongruity discussed earlier, vowels are a much more conspi¬ 
cuous source of variation than consonants, despite being much fewer in number. 
The schwa is almost always written with a fatha, but for tense vowels there is 
considerable vacillation between short and long representations. A given word’s 
transcription may differ even within the same manuscript, but five main distri¬ 
butions may be identified, falling along a continuum from preference for short 
vowels (greatest in early and religious sources) to preference for long ones (grea¬ 
test in recent and secular ones): 

1. All vowels are written short, without exception (except sometimes in Arabic 
loans, which optionally retain their original orthography). This is observed 
in the Aheddad poem, e.g. <isufyit> issufyit ‘he got him out’, <anbi> 
ennbi ‘the prophet’ (an Arabic loan!), and in Tagamount’s (1995) dictionary. 
In this system, Arabic vowels are used purely for their quality, and no effort is 
made to identify length with tenseness. 

2. Most vowels are written short; in the rare exceptions (more or less common 

depending on the source), only one vowel per orthographic word is written long 
(excluding Arabic loans). This vowel is always tense, and normally in a penul¬ 
timate or final open syllable. This is observed in KA21, e.g. J <urimnieara> 
ur imnie am ‘it does not prevent’, <asakkudls> asekkud-is ‘his seeing’ 

(Gutova 2011, 56, 64); in the Aheddad wordlist, e.g. <uratnuddumara> 

ur tnuddum am ‘do not get sleepy’, jUiiij! <urtattisara> ur tettis ara ‘do not 
sleep’; in the Bible translations, e.g. jsjj' <innayas ayargaz> inna-yas ay 
argaz ‘he said to him: 0 man’ (Anonymous 1833, Lk. 12:14), <akkagi 

lawanaqaray> akkagi la-awen eqqarey ‘thus I say to you’ (British and Foreign 
Bible Society (1904), Lk. 15:10, quoted in British and Foreign Bible Society 
(1965)); and in Hanoteau (1867), e.g. <dakjufar> deg (i)jufar ‘in the lower 
clothing’ (p. 205), <dimulaban> d imulaben ‘are lizards’ (p. 179). In most 
examples the long vowel seems to be stressed, and hence is in fact phonetically 
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longer; this orthography thus indicates a sensitivity to the length distinction 
that is phonemic in Classical Arabic but merely phonetic in Kabyie. 

3. If a word contains any tense vowels, usually either one or two of them is 
written long (although, more or less frequently depending on the source, 
they may also all be written short). This is observed in Brosselard (1844), e.g. 

<tamasahut> tamasahut ‘story’ (p. 129), J <ulayar> ulayer ‘there 
is no point’ (p. 286); in the qanun of Ait Ali Ouharzoune, e.g. iyn y' <ayazld> 
ayazid ‘rooster’, <yaskldban> yesldddben ‘accusing of lying’, 

<azalis> azal-is ‘its value’; in Rinn (1887), e.g. jjjC <b’bwr> babur ‘Babor (place 
name)’ (p. 58), <s < Twdyw> s u'i'udiw ‘with a horse’ (p. 56); in Udifella’s 

poems, e.g. y^ <akyuyal> a-kyuyal ‘he will return you’, <attihudit> 

n teyhudit ‘of malice’, <yatnadiyas> yettnadi-yas ‘he seeks for him’; in 

the Qur’an translations, e.g. jj4l <lumur> lumur ‘affairs’, <timital> timital 
‘example’. In the qanun, unusually, even schwas may optionally be written 
long in final syllables: Q'j j) <yarwal> yerwel ‘he fled’. This practice too reflects 
phonetic reality to some extent, in that vowels before the main stress seem to 
have shorter duration. 

4. All tense vowels are normally written long, except sometimes in antepenulti¬ 

mate or earlier positions. This is observed in Boulifa (1904), in Ferad (2003), 
e.g. j' <’uhdiq> uhdiq ‘polite’ (p. 240), cuJI jli <tazalit> tazallit ‘prayer’ (p. 
257), £ <’adamiruh> ad-am iruh ‘it will go for you’ (p. 235), and in Belaid 
(1995), e.g. \ jU' <’lt(w)mwbyl> ettumubil ‘car’ (p. 166), 2~ .. nhL > :i 

<tmlylyt ’ntwtTyyn> timlilit n tutlayin ‘language contact’ (p. 216). In 
that most tense vowels are written as long, this probably reflects interference 
from colloquial Algerian Arabic, which has the same vowel system as Kabyie, 
but with schwa as the normal reflex of Classical short vowels, and tense 
vowels for Classical long vowels. 

5. All vowels are written using matres lectionis, even schwa (only in the text¬ 

book of Hrouch et al. (2003)). This practice can be explained neither in terms 
of Classical nor of Algerian Arabic; rather, like much else in this textbook, is 
obviously calqued from the Latin orthography which by that time was pre¬ 
dominant for Kabyie. Examples: <tixsi> ‘ewe’, > <’izem> izem ‘lion’. 

Basset (1887) cannot readily be placed in any one category, since he quotes texts 
from several different sources. 

In all but the last of these systems, schwa is written using a fathah, as is a in 
certain circumstances. This, to varying degrees, creates ambiguity between e and a. 
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5 Conclusions 

Kabyle has repeatedly been written in Arabic script since the pre-colonial period, 
particularly in the east. This practice was a natural outgrowth of religious edu¬ 
cation, and as such was never suppressed entirely despite difficult conditions. 
However, despite a history of at least three centuries, only those conventions 
imported directly from Arabic are stable; where orthographic choices must be 
made, those choices vary substantially from one source to the next. A few long¬ 
term trends are observable, in particular the replacement of /cd/with qaf as a basis 
for transcribing g and an increasing preference for matres lectionis over diacritics 
in representing vowels, but most of the variation is purely individual. A neces¬ 
sary precondition for this variation was the existence of significant divergences 
between Arabic and Kabyle phonology, notably greater than those observed for 
Shilha. While those divergences made the task of establishing a consistent ortho¬ 
graphy more difficult in principle, the fact that no one solution to these chal¬ 
lenges has ever been consistently adopted for long primarily reflects historical 
rather than linguistic facts. At no point has any one writer’s or school’s Arabic- 
script work been sufficiently widely read to be imitated, and all but the most 
prominent of one generation’s orthographic innovations have been forgotten by 
the next. This is to be expected given the repeated massive upheavals that have 
affected education in the region and the fact that teaching Kabyle has never been 
a primary goal of any of its educational systems, and is unlikely to change: while 
the belated introduction of Kabyle teaching after 1995 has provided a powerful 
new tool for standardisation, the general preference among today’s Kabyle-lan- 
guage writers and teachers for Latin makes it practically certain that this standar¬ 
disation will only affect the now-dominant, and already relatively stable, Latin 
orthography. 

This chapter’s conclusions are fairly solid insofar as they concern the print era; 
more data on political posters and album covers would be helpful, but appears 
unlikely to alter the overall picture. However, data on Kabyle manuscripts, parti¬ 
cularly those dating to the pre-colonial period, remains frustratingly scanty. There 
is no reason to believe that all relevant manuscripts are known, and almost all of 
those that have been reported in the literature are still unpublished. Any conclu¬ 
sions concerning manuscript practices are thus necessarily somewhat tentative, 
and should be tested against newly discovered or newly published manuscripts 
as they become available. 
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Alessandro Gori 

Beyond ‘agamf in Ethiopia: a short Note on 
an Arabic-lslamic Collection of Texts written 
in Ethiopian Script (fidal) 

Abstract: As in many other countries in Sub-Saharan Africa, Muslims in Ethiopia 
have produced a substantial amount of literature in their local languages, mostly 
using the Arabic script rather than the Ethiopic syllabary (fidal), which has been 
connected with the Ethiopian Christian Church and state for centuries. Recently, 
after fidal was adopted to write the language of the Harari Muslim people, a 
manuscript was written in which texts in Arabic and Old Harari were copied in 
Ethiopic script. I have analysed the manuscript in this paper, highlighting the 
strategies that the copyist followed in order to transcribe Arabic into fidal, and 
have attempted to place the work within the general framework of Islamic-Chris¬ 
tian relationships in Ethiopia. 


1 Fidal , Arabic and Islamic languages in Ethiopia: 
sketching a framework 

The usage of the Ethiopian script (fidal ) 1 to write Arabic and other ‘Islamic langu¬ 
ages’ of the Horn of Africa 2 is not completely unknown, but has so far been scar¬ 
cely documented and analysed. While it is actually a relatively rare phenomenon, 
it does deserve some attention, as its study could be of some interest for linguists 
(particularly sociolinguists) and more generally for researchers investigating the 
social and cultural history of the Muslims of the Horn of Africa. 


The data presented in this paper were compiled within the framework of and thanks to the re¬ 
search project ‘Islam in the Horn of Africa: a Comparative Literary Approach’ (IslHornAfr, Euro¬ 
pean Research Council Advanced Grant 322849). 


1 For a first introduction to Ethiopian script, see Frantsouzoff 2010. 

2 I use the concept of ‘Islamic languages’ here in a very loose way as a generic term to refer to 
languages spoken (also) by Muslims and with no intention to touch on the issues discussed by 
Bausani 1967,1978,1981, Brenner and Last 1985, Zappa 2004, 2009 and Gori 2015. 

3 Open Access. © 2019 Alessandro Gori, published by De Gruyter. |l")l!n!l2|3| This work is licensed 
under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. 
https://doi.org/10.1515/9783110639063-012 
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Some marginal paratexts among Christian Ethiopian manuscripts in Ga'az 
and Amharic have been found to include Arabic words, sentences and also short 
fully-fledged texts written in Ethiopian and/or in Arabic scripts. 3 Christian Ethi¬ 
opian astronomical and calendar treatises (e.g. the Bahra hassab, ‘The sea of 
calculation’), 4 contain many Arabic words and expressions. In the famous 16 th - 
century Christian treatise Anqasa amin (the ‘Door of the faith’), written in Ga'az, 
Arabic-Islamic religious terms are quoted and transcribed using fidal script. 5 
Also, the 20 ,h -century Amharic Christian Sylloges of sayh Zakaryas contain 
several words and phrases taken from Arabic. 6 During my recent examination 
of MS EMML 6239,1 identified a bilingual Arabic-Amharic version of sayh Zaka¬ 
ryas’ work written entirely in Ethiopian fidal. 7 Arabic-Islamic words are also 
scattered throughout Christian Ethiopian magical literature, where they func¬ 
tion as ‘abracadabra’ to enhance the mystical power of the texts. 8 The writing 
down of Islamic texts in Arabic or Amharic or their transliteration into one of 
these languages was usually carried out by or for the sake of Western scholars 
working in Ethiopia. 9 


3 Examples can be found among the manuscripts catalogued by the Hill Monastic Manuscript 
Library in Collegeville, MN, USA in the Ethiopian Manuscript Microfilm Library (EMML). See for 
example in MSS EMML 1056 fols 171b-174a (Ga‘az equivalent of Turkish and Arabic words); 1105 
fol. 129b (name of Mary in Arabic); 1187, fol. 73b and 74a (names of the months in Arabic); 1168 
2a (Arabic numerals); 1419 fol. 42b (Arabic names of the stars and of the months); 1373 fol. lb 
and recto of the last rear guard leaf (exercising in Arabic); 1599, last rear guard leaf (Arabic nu¬ 
merals); 2011, fol. 72a (Arabic prayer in Ethiopian letters); 2054, fol. 247a (note in Arabic and 
Ga‘az); 2436, fol. 160a (prayers in Arabic written in Ethiopian letters); 2531, fol. 175b (explanation 
of Arabic and Hebrew words); 2548, fol. 4a (name of Mary in various different languages); 2630, 
fol. 165b (name of Mary); 3127, fol. 174b (pen trial); 4036, fol. 4b (theological formula Arabic- 
Amharic); 4836, fol. 78b (unreadable note); 4909, fol. 145b and fol. 149a (stamp of nagus Mika’el 
of Wallo); and 4922 fol. lb (a holy image printed in some Arab country). 

4 Pietruschka/Amha Asfaw/Getatchew Haile 2003. 

5 The Anqasa amin is a polemical work written by Hnbaqom a learned Muslim man who conver¬ 
ted to Ethiopian Orthodox Christianity in the first half of the 16 th century and subsequently wrote 
a defence of the Christian faith also using verses from the Qur’an. The Ethiopic Anqasa amin 
text was edited (and translated into French) by Emeri van Donzel in 1969. For an analysis of the 
Arabic passages, see especially van Donzel 1969, 39-43. 

6 On the Sylloges of sayh Zakaryas, see Gori 2001 and Gori 2003. On the author, a learned Muslim 
man (c. 1845-1920), who converted to Christianity towards the end of the 19 th century, see Gori 2014a. 

7 I gave a presentation on this text at the Colloque Manuscrits Chretiens et islamiques d’Ethiopie 
(XHIe-XXe siecle) in Paris on Dec. 12 2014 (see Gori 2014b). 

8 A remarkable example of a magical prayer made up of passages of the Qur’an is discussed by 
Marcel Griaule (see Griaule 1930,109-110). 

9 For example, see the texts published by Enrico Cerulli in 1926; and the collection of Qur’anic 
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Moving from the Ethiopian Christian to the Ethiopian Islamic cultural land¬ 
scape, the existence of Arabic-lslamic texts written in fidal becomes less vague, 
at least during the modern era. A cultural and religious prejudice identifying 
Ethiopian script with Ethiopian Christianity may have hindered the usage of 
fidal among Muslims at an early stage. However, since the mid-20 th -century 
emergence of Islamic books printed in Amharic and Tigrinya, Ethiopian script 
has also begun to be used to write scattered words and phrases in Arabic. After 
the fall of the Socialist regime in 1991, a more or less religiously oriented Muslim 
book production began to blossom in Ethiopia, and fidal became more and more 
widespread among Amharic- and Tigrinya-speaking Muslims who make use of 
it to transliterate passages from the Qur’an, Hadith and other fundamental texts 
of the Islamic tradition for books and booklets published in local languages. 
The subsequent birth and diffusion of a kind of Islamic calligraphic practice in 
Ethiopian script came to confirm a fairly generalised acceptance of fidal among 
many Ethiopian Muslims. 

An especially remarkable position in the general picture I am sketching here 
is occupied by Harari, the Semitic language of the city of Harar, spoken by the 
ethnic group of the Hararis. Traditionally written in Arabic script since at least 
the beginning of the 18 th century, 10 Harari texts written in Ethiopian script have 
been found to have emerged at the end of the 19 th century, after the walled city 
was conquered by Manilak in 1887 and incorporated into the modern Ethiopian 
state. 11 The connection between the Ethiopian writing system and the Harari lan¬ 
guage became even tighter after the fall of the Imperial state (1974) and under the 
Socialist regime. Harari texts in Ethiopian alphabet were published abroad 12 and 


passages compiled by alaqa Tayya for Eugen Mittwoch in Berlin who published them in 1906 in 
Ethiopian script. 

10 For a general description of Harari literature in Arabic script, see Banti 2005, 2010. The lan¬ 
guage of the texts written in Arabic script has been labelled ‘Old Harari’, because it shows some 
quite substantial differences from modern Harari as spoken and written in Harar nowadays. 

11 See for example the texts written by the secretary of ras Makwannan for Casimir Mondon- 
Vidailhet and published by Carlo Conti Rossini in 1919 (see Wagner 2004, 355, note 5 with other 
examples). 

12 See for example the Suwar malasayacc taba (‘Voice of the revolutionary youth’), a handwrit¬ 
ten ideological journal produced by the Harari Students’ Association in Egypt ( Harari ardawigacc 
ahadannat mugad misrabe ). On a website entitled Everything Harar (http://www.everythingha- 
rar.com; last accessed 04/02/2018), which is run by Harari individuals from the Diaspora, I found 
two issues of the periodical (vol. 3 no. 2, fanuary 1980 and vol. 3 no. 3 [originally 2], February 
1980) which I was able to retrieve and download. The periodical was apparently produced in 
Egypt, but no information is available or traceable about its origin and periodicity. The content of 
the texts is extremely interesting from both a politico-cultural and from a purely linguistic point 
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in Ethiopia, 13 fostering the diffusion of fidal among Hararis (first and foremost 
among the intellectual elite). 

After the pro-Soviet regime collapsed in 1991, a relatively short period of 
incertitude followed: different options (Latin, Arabic, and Ethiopian scripts) for 
the writing of the Harari language were discussed, until eventually the choice 
officially fell on fidal. 1 ' 1 The appearance in 1992 of the first Harari-Amharic dic¬ 
tionary published entirely in fidal paved the way to further development of the 
connection between the Harari language and the Ethiopian script. 15 

Fidal is nowadays well established among Harari speakers and readers and 
has progressively become more deeply rooted in the writing and reading practi¬ 
ces and in the literary production of Hararis. At the same time, Latin script has 
not completely been entirely abandoned and still seems to be quite widespread 
among Hararis living outside Ethiopia, 16 and it is extensively used on the web. 17 

What was probably also instrumental in this impressive success of the Ethi¬ 
opian script among the Hararis was the diffusion of Amharic in the city of Harar 
and especially in the Harari diaspora. Numerically substantial and economically 
lively communities of Harari speakers can be found in Addis Ababa, Dire Dawa 
and many other cities of Ethiopia. These scattered groups of Hararis (in particular 
those living in the capital) are more and more exposed to Amharic, are practi¬ 
cally bilingual and tend to use written Amharic for their daily business and in 
connection with official and bureaucratic issues. The combined effect of the shift 


of view. In January 1986, the first issue of another magazine, Sahan (‘The ray’), was published 
in fidal in Rome by the Harargey tarih-wa add dafdafti mugad (‘Society for the research on Ha¬ 
rari history and customs’). This periodical was subsequently transferred to Toronto (see Wagner 
2003a, 6-7 and Wagner 2004, 355, note 6). 

13 One example is the magazine Aner (‘Make it beautiful’), published by the Harari Cultural As¬ 
sociation of Dire Dawa with an uncertain periodicity since 1988. According to Wagner 2003a, 1, 
the first two issues (Sept. 1988 and Oct. 1988) were written in Arabic script. On the internet portal 
Everything Harar , I found a later issue (probably issue no. 2,1990) produced in Ethiopic script. 

14 For the cultural (and political) background of this decision, see Gibb 1998, 256-7 and 266, 
note 18. The first issue of Harari taba, the official newspaper of the Harari National League, was 
published in Hadar 1984 CE (= November-December 1991). This event might be considered a 
sort of official endorsement of the adoption of the Ethiopian script by the officials of the Harari 
regional state. 

15 Abdurahman Mahamad Qorram 1984 CE (= 1992). 

16 It is estimated that one-third of the Harari people live outside Ethiopia. The most conspi¬ 
cuous Harari communities are now living in Australia, Canada and the USA (see Gibb 2002). 

17 It must be noted that, since the Roman keyboard is surely the most widespread on compu¬ 
ters, Amharic and Tigrinya written on the Internet often use a very simplified transliteration 
system based on the Latin script. This writing practice has not so far been studied by scholars of 
Ethiopian languages. 
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to fidal to write Harari and the wider and more intense exposure to Amharic has 
reduced the knowledge and usage of Arabic script among the Harari people and 
the knowledge of Arabic itself. 

As a matter of fact, while Arabic is known as a language of culture by a very 
restricted circle of learned men and is used as a spoken medium of communica¬ 
tion by Harari emigrants returning from jobs in the Gulf, the usage of Arabic script 
to write Harari seems to be vanishing completely. Moreover, the capability to read 
and understand traditional Old Harari texts written in Arabic script is apparently 
declining dramatically. Manuscripts in Arabic script (both in Arabic and in Old 
Harari language) are not copied anymore but are substituted by computer-typed 
books and electronic documents. The recent production of fidal reprints of the 
Kitab al-fara’id (‘The book of obligations’), probably the most famous piece of 
Harari literature, proves the sorely felt need for editions of traditional texts tar¬ 
geting the growing number of readers who are unable to read Arabic script. 18 It is 
this general sociolinguistic framework which forms the context for the origin of 
the manuscript I am going to deal with in the rest of this chapter. 


2 Arabic in fidal: the case of a Harar manuscript 

The manuscript I am about to discuss was photographed in Harar on 21 Septem¬ 
ber 2003 by Dr Simone Tarsitani (Durham University) 19 during an ethno-musi- 
cological research mission he was conducting. The item is basically a collection 
of different texts written down by the late Mr Abdi Abubakar Sufiyan 20 with a 
blue ballpoint pen (some parts are in red) on a personal organiser (15cm x 20cm) 
for the year 1990 of the Ethiopian Calendar (i.e. year 1997-1998 of the Gregorian 
calendar; the Ethiopian year starts on 11 September). The manuscript pages lack 
proper numeration, therefore in the following I will locate the texts according to 


18 For some general information about the Kitab al-fara’id, see Wagner 2005; the text has been 
published under the title Kitabul fara’id in Ethiopian script in Dire Dawa at Khalaf Mattamiya bet 
(i.e. printing press) s.d. The PDF of the text can be downloaded from. Everything Harar at http:// 
www.everythingharar.com (last accessed 05/02/2018). 

19 I am very grateful to Dr Tarsitani for providing me with the images of the manuscript and for 
authorising me to study them. I would also like to acknowledge his generosity in providing me 
with all the information in his possession about the origin of the manuscript. 

20 Mr Abdi Abubakar Sufiyan was actively involved in the devotional practices at the sanctuary 
of Aw Basor in Harar and made many efforts to keep the tradition of the Mawlid recitation alive for 
the benefit of the new generation. The origin of the present manuscript in fidal can be found in the 
context of Mr Abdi’s cultural and religious activities (personal communication from Dr Tarsitani). 
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the dates shown on the daily agenda (every single page is devoted to a different 
day, except for the weekends: Saturdays and Sundays share the same page). 

The texts are written almost continuously from the first day of the year (11 
September) until 6 April. The rest of the organiser is blank, except for the pages of 
4 September and 8 September (the pages in between these two dates have been 
torn out) and the page of the weekend 9-10 September (the last two days of the 
Ethiopian year). The manuscript contains texts in three languages written in Ethi¬ 
opian script: Old Harari, Arabic and Oromo. Besides, English in Latin script and 
Arabic in Arabic script (8 September, surat al-kahf, Qur’an 18, verses 1-10) are 
also copied in the agenda. 

The texts copied into the manuscript belong to what I have elsewhere 
referred to as the ‘Harari Mawlid collection’, 21 that is, the constellation of poetical 
and prose texts (mainly in Arabic, but interspersed with sustained wide sections 
in Old Harari) which is usually recited by Hararis to solemnise the feast of the 
birthday of the Prophet and on other important public and private occasions of 
a religious but sometimes also a secular nature. The collection appears in a wide 
diffusion of manuscripts (both relatively ancient and very recent) in Ethiopia. 

While no comprehensive analysis of the manuscript tradition of the ‘Harari 
Mawlid collection’ has as yet been carried out, the following data are available: 

In the library of the Institute of Ethiopian Studies at the University of Addis Ababa, the fol¬ 
lowing eight manuscripts contain the ‘Harari Mawlid collection’: 264:2, 273:3,1855:2, 2662:2- 
3, 2663:2, 2664:2 (incomplete), 2665:2, 2666:2 (see Gori and others 2014 under the relevant 
entries). The Abdallah Sharif private museum in Harar 22 keeps 16 codices of the ‘Harari 
Mawlid collection’: 23 1435 Abdulahi Collection 93 (not dated, but certainly 20 th century); 
1436 Abdulahi Collection 94 (not dated, but certainly 20 th century); 1437 Abdulahi Collec¬ 
tion 95 (not dated, but certainly post-1974); 1439 Abdulahi Collection 97 (not dated, pos¬ 
sibly 18 th century); 1440 Abdulahi Collection 98 (dated 1395/1975); 1441 Abdulahi Collection 
99 (not dated, but certainly post-1972); 1442 Abdulahi Collection 100 (not dated, possibly 
19 th century); 1443 Abdulahi Collection 101 (not dated, possibly 18 th century); 1527 Abdu¬ 
lahi Collection 185 (dated 29 May 1918); 1528 Abdulahi Collection 186 (dated 1142/1729-30); 
1529 Abdulahi Collection 187 (dated 9 July 1875); 1569 Abdulahi Collection 227 (not dated, 
possibly 20 th century); 1576 Abdulahi Collection 234 (not dated, possibly 19 th century); 1690 


21 See Gori 2010 for a first description of the structure of the collection. 

22 www.unesco.org/new/en/culture/themes/dynamic-content-single-view/news/opening_of_ 
the_permanent_exhibition_of_the_sherif_harar_city (last accessed, 06/02/2018). 

23 References are given according to the current provisional numbering used for PDF files re¬ 
alised in May 2011 by Jeremy Brown within the framework of the Ethiopic Manuscript Imaging 
Project directed by Professor Steve Delamarter, to whom I am very grateful for allowing me to 
study the items. 
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Abdulahi Collection 348 (not dated, but certainly before 1810); 1691 Abdulahi Collection 
349 (not dated, but 20 th century); 1692 Abdulahi Collection 350 (not dated, but 20 th century). 

The ‘Harari Mawlid collection’ has also been printed several times both abroad 
and in Ethiopia. 

In Egypt, the work has been printed twice: in 1350/1931 (127 pages) and in 1366/1947 (100 
pages), under the title Mawlid saraf al-'alamln (‘The birth of the honour of the universe’). 
Two editions published in Ethiopia (around 1992-1993 and in 2000) under the title Kitab 
‘unwan al-sarif bi-al-mawlid al-sarif (‘The book of the noble title on the noble birth’) are 
actually photomechanical reproductions of two manuscripts. The 1992 edition (160 pages) 
contains a manuscript dated 1412/1992-1993 and written by an anonymous copyist. The 
2000 edition (171 written pages) reproduces a manuscript dated 26 ramadan 1421/22 Decem¬ 
ber 2000 and written by the famous copyist Ibrahim Muhammad Wazlr. A third Ethiopian 
printed edition was published in Dire Dawa at an unknown date with the title Mawlid 
saraf al-‘dlamin (‘The birth of the honour of the universe’) at the expenses of Mahdi hagg 
‘Abdallah. It seems to be a reprint of one of the above-mentioned Egyptian books. Finally, 
in sa‘ban 1426/September 2005, a computer-typed text (175 pages) was circulated in Addis 
Ababa under the title Kitab mawlid saraf al-‘alamin (‘The book of the birth of the honour of 
the universe’) by the Harari scholar Abu Bakr Tabit (Sabit). 

This conspicuous number of testimonies proves the high esteem that the text coll¬ 
ection enjoys among Hararis both inside and outside the city of Harar; however, 
despite this, research on the origins and the first diffusion of this text collection 
is still scanty and insufficient. 24 

One of the most remarkable features of the ‘Harari Mawlid collection’ is that 
while it is structured according to a relatively stable general framework, the 
verses and prose sections which actually make up the textual constellation and 
the sequence in which they are put together show a high degree of variation from 
manuscript to manuscript and from book to book. 

Among the testimonies of the ‘Harari Mawlid collection’ I am aware of I was 
not able to identify a direct model for the manuscript I present here. 

Moreover, for the specific purposes of this chapter, I will only focus here on 
a number of main Arabic texts which I managed to identify, leaving to another 
occasion a full description of the unknown or unidentified Arabic texts and the 
analysis of the Oromo and Harari sections. 


24 In addition, the very famous and somewhat controversial Harari scholar Abdallah al-Harari 
(d. 2008) realised an abridged version of it ( muhtasar ) and published it at an unknown date in 
Beirut at the Dar al-masari‘ (the publishing house of the so-called Ahbash movement). 
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When lacking a well-established title, poetic texts are identified by their 
incipit; for cross-reference, I will use the September 2005 edition of the ‘Harari 
Mawlid collection’. 

Here is a list of the texts I managed to identify in the fiddl-Arabic part of the 
manuscript: 

6-8 October: poem Tufbi-hdni (‘Walk around my tavern’); 25 
8-9 October: prayer for the Prophet and sayh Abadir; incipit: Salatun wa-tasli- 
mun wa-azka tahiyyatin ‘ala Ahmad al-muhtar taha al-mafahir (‘Peace, greet¬ 
ing and best salutations to Ahmad, the chosen, the possessor of the glori¬ 
ous deeds’); tawassaltu bi-sayh al-Abadir (‘I ask the intercession of sayh 
Abadir’); 26 

10-14 October: poem Hadi al-‘is adrik ka’s al-hana’ (‘0 Camel Driver, get the 
chalice of happiness’); 27 

15 October, first part of the page: poem Hud yaminan (‘Take a right hand’) 
attributed to the renowned Yemenite mystic master ‘Abdallah Ibn ‘Alawl 
al-Haddad (1634-1720); 

- 21 October (second part of the page)-27 October: poem on different epi¬ 
thets of the Prophet along a model structure: - [Muhammad who is] sahib 
al-X [possessor of something, in an idafa construction], e.g. 1 st line: Sahib 
al-mu‘gizat al-qati‘at (‘The possessor of the irrefutable miracles’; 2 nd line: 
Sahib al-barahin al-sati‘a ‘Possessor of the clear proofs’); 28 

- 27-28 October: Asma’ allah al-husna (‘The beautiful names of God’); 

6-7 November: hymn in praise of the Prophet; incipit Ya wagha al-dini wa-al- 
karam (0 face of the religion and of the generosity); 


25 Mawlid edn 2005,12-13; attributed to ‘Abd al-Qadir al-Gilam. The text of the poem as preser¬ 
ved in the ‘Mawlid collection in Harar’ is different from that usually featured in ‘Abd al-Qadir’s 
Diwan (see e.g. al-GIlam s.d: 157-164). 

26 Mawlid edn 2005,124-5; sayh Abadir is the patron saint of Harar and the most revered holy 
man in the city (for a general introduction to his personage, see Wagner 2003b). A tahmis of the 
text is published in Wagner 1975, 47-53. 

27 Mawlid edn 2005, 84-93. Tastir (possibly produced by an unknown ‘Abdallah b. Ga‘far al- 
‘Alawi al-Yamani) based on the poem Inna lam‘a al-barq min hayfi mind (‘As the gleam of the 
lightning feared the desires ’) composed by the Sudanese author Muhammad b. Tahir al-Magdub 
(1842-1929), a member of the Burhaniyya brotherhood. The text copied into the manuscript 
under discussion extends to the verse: SafVu al-halqi fi hayratihim (‘Intercessor of the creatures 
in their bewilderment’). 

28 I am here transcribing the phrase as it should be according to the rules of Classical Arabic: the 
transliteration system of Arabic used in the text is specifically analysed in part 3 of this chapter. 
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7- 13 November: hymn in honour of the Prophet; incipit: Al-hamd lillah al-qa- 
dim/galla rabbi ‘an al-mital (‘Praise to God the Eternal, my Lord is beyond any 
similarity’); 

14-21 November: another section of the above-mentioned poem Hadi al-‘is 
adrik ka’s al-hana’ (‘0 Camel Driver, get the chalice of happiness’); from 
the verse man arada al-nagwa (‘the one who wants the upland’) to the end 
sayfuhu munsalitan ma wahana (‘His sword is drawn and he is not weak’); 
21-22 November: the qasida Banat Su‘ad (‘Su‘ad is gone’, also called al-Burda 
‘The Mantel’, like al-Busiri’s poem) by the famous Ka‘b b. Zuhayr (first a fierce 
opponent of Muhammad, then one his Companions) with anonymous tahmis 
Hanat Su‘ad (Su‘ad arrived’, up to the verse Hayfa’ muqlatin ‘agza’ mudbira- 
tin la yustaka qisarun minha wa-la tul (‘Thin appears when she comes and 
well-proportioned when she turns away, not too short nor too tall, no com¬ 
plaints about her height’); 29 

22 November-3 December: poem in honour of Abu Bakr (with tahmis); incipit: 
Ay a sa’ili ‘an madhi man gallafi al-dikr (‘0 the one who demands about the 
one who is honoured in the dikr’); 30 

4-5 December: poem in praise of ‘Abd al-Qadir al-GIlam; incipit: ‘Ala al-awl- 
iya’ alqaytu sirri wa-burhani (‘On the saints I have thrown my secret and my 
proof’); 31 

6-7 December: poem in praise of ‘Abd al-Qadir al-GIlani; incipit Yd qutb 
al-wugud bi-kum safa wugudi (‘0 pole of the existence through you my exist¬ 
ence is purified’); written in two columns; 32 

8- 9 December: invocation to God; each verse starts with one different letter 
of the Arabic alphabet in our manuscript reproduced in a very coarse hand; 
incipit: Ilahun wahidun munfaridun bi-datih (‘Only one God, alone in His 
essence’); 33 

10-13 December: poem in praise of the Prophet; incipit Salatun salamun 
ka-miski al-hitam (‘Prayer and greeting like the musk of the end’; attributed 
to ‘Abdallah Ibn ‘Alawi al-Haddad); 34 

13-15 December: poem in praise of the Prophet known from the Arabic-Ha- 
rari refrain: JVur salam nur salam (‘Light and greeting, light and greeting’); 


29 The bibliography about Ka‘b and his Banat Su‘ad is extremely wide: see at least the article 
Sells and Sells 1990. 

30 Mawlid edn 2005, 70-78. 

31 Mawlid edn 2005,100-101. 

32 Mawlid edn 2005, 94-95. 

33 Mawlid edn 2005, 98-99. 

34 Mawlid edn 2005, 79-82. 
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Arabic incipit: Sahidna bi-anna Allaha zakka Muhammada (‘We testify that 
God attested the truth of Muhammad’); 35 

15-18 December: poem in praise of the Prophet known from the Arabic-Harari 
refrain: Aw ayo salam aw ayo salam (‘Father, mother salute! Father, mother, 
salute!); Arabic incipit: Gamalun bada min fawq al-‘izzi dalla muhtagun 
fa-qala hazzi (‘A beauty appeared from above the high level a needy pointed 
out and said “Swing!”’); 36 

18-19 December: mystical poem; incipit: Bi-haqqi allah rigal allah (‘For God’s 
sake, o men of God’); 37 

- 20-22 December: invocation; incipit: Habii li minfadlikum habu li (‘Grant me 
from your grace, grant me!’); 

- 24-29 December: hymn in praise of the Prophet; opening formula: Say’ lillah 
rasul allah yd sayyidi habib allah al-madad rasul allah (‘0 dear to God, mes¬ 
senger of God! 0 my lord, beloved of God! Help, messenger of God!’); incipit: 
Abtadi’u bi-smi llahi madha hayri halqi llah (‘I commence in the name of God 
the praise of the best of God’s creatures’); 38 

- 4 September: prayer for the Prophet and sayh Abadir (the same as in 8-9 
October); incipit: Salatun wa-taslimun wa-azka tahyatin ‘ala Ahmad al-muhtar 
taha al-mafahir (‘Peace, greeting and best salutations to Ahmad, the chosen, 
the possessor of the glorious deeds’); tawassaltu bi-sayh al-Abadir (‘I ask the 
intercession of sayh Abadir’). 


3 Fidtil for Arabic: an easy adaptation? 

The relative proximity of the Arabic phonetic system to that of many of the 
Semitic languages of Ethiopia and Eritrea could generate the idea that the usage 
of fidal to write Arabic might be a relatively simple adaptation process. I believe 
that the following few observations on the way Arabic has been transcribed in the 
manuscript under analysis only partially confirm this idea. 

First of all, the linguistic interference represented in the manuscript is that 
between Arabic and Harari. To briefly highlight the main differences between the 


35 Mawlid edn 2005, 62. 

36 Mawlid edn 2005, 57-58. 

37 Mawlid edn 2005, 52-54. 

38 Mawlid edn 2005,121-24. 
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phonologies of these two languages. Wolf Leslau’s contribution on the Arabic 
loanwords in Harari is useful as a reliable reference: 39 

Harari, on the one hand, possesses four phonemes which are absent in 
Arabic: a voiceless palato-alveolar affricate /c/; an ‘emphatic’ (ejective, globa¬ 
lised) voiceless palato-alveolar affricate /c/; a palatal nasal /n/; and a voiced 
velar stop /g/. Arabic, on the other hand, has seven phonemes which do not exist 
in Harari: a voiceless (inter)dental fricative /t/; a voiced (inter)dental fricative 
/d/; an ‘emphatic’ (pharyngealised) voiced alveolar fricative /z/; an ‘emphatic’ 
(pharyngealised) voiceless alveolar fricative /s/; an ‘emphatic’ (pharyngealised) 
voiced dental stop /d/; a voiced velar fricative /g/; and a voiced pharyngeal fri¬ 
cative /*/. 

Moreover, there are four further important differences 1) the so-called 
‘emphatic’ consonants are pronounced as ejectives (globalised) in Harari, while 
they are basically pharyngealised in Arabic; 2) in Harari the voiceless velar fri¬ 
cative [h] appears only as an intervocalic variant of the voiceless velar stop /k/ 
as a consequence of spirantisation and thus has no phonemic status. In Arabic 
loanwords, /h/ passes to /k/ but is sometimes preserved; 3) Harari has a voiceless 
pharyngeal fricative /h/, but no voiceless glottal fricative /h/. Arabic /h/ is thus 
rendered as /h/ in Harari, but in some cases the original /h/ is preserved; and, 
finally, 4) the vocalic system of Harari is still an object of discussion among lingu¬ 
ists, but latest research points to the phonemic value of vowel quantity. 40 It thus 
seems that in terms of both vowel quantity and vowel quality the phonology of 
Harari can easily accommodate Arabic words. 

In the end, Leslau’s analysis clearly demonstrates that the passage of a rela¬ 
tively substantial amount of Arabic loanwords (about 300) into Harari vocabulary 
did not take place according to a stable correspondence among phonemes of the 
two languages. The inconsistent phonetic adaptation is apparent when conside¬ 
ring the way the Arabic phonemes (in particular those unknown to Harari) are 
rendered in Harari. 41 

It can be surmised that the picture is complicated by the fact that the Arabic 
loanwords’ mode of entry into Harari is twofold: 1) oral, through one of the forms 
of spoken Arabic to which the speakers of Harari are exposed; 42 and 2) written. 


39 See Leslau 1956, and Leslau 1957, which contains a comprehensive description of the diffe¬ 
rences between the Arabic phonological system and that of other Semitic languages of the Horn 
of Africa. 

40 For a detailed discussion on this, see Garad-Wagner 1998, 157-168, and for a more general 
description, see Wagner 1997, 487-488. 

41 For the details, see in particular the recapitulative table in Leslau 1956, 21. 

42 Leslau (1956, 22-23), indicates a number of Arabic dialects as possible sources for at least 
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through a learned milieu, especially that of Arabic teachers and Islamic scholars 
who are well acquainted with the Classical Arabic of the Qur’an and the theolo¬ 
gical tradition. 43 

In the case of the manuscript I am presenting here, the issue of the Arabic 
dialects remaining behind or underneath the words is not relevant. Nevertheless, 
the idea that an oral dimension of the texts has influenced their transcription 
cannot be excluded from these considerations. 

The main features of the transcription system used in the manuscript for 
depicting Arabic words in Ethiopian script can be briefly sketched as follows: 

- voiceless (inter)dental fricative /{/: written as li <s> for voiceless alveolar fri¬ 
cative /s/; e.g. ('llIT- for tubiitu (‘His immutability’); 

- voiced (interdental fricative /d/: written as H <z> for voiced alveolar fricative 
/z/; e.g. riHtih. for bi-datihi (‘By His essence’); 

‘emphatic’ (pharyngealised) voiced alveolar fricative /z/: written as H <z> for 
voiced alveolar fricative /z/; e.g. Hrhtt for zahara (‘It appeared’); 

‘emphatic’ (pharyngealised) voiceless alveolar fricative /s/: written as n <s> 
for voiceless alveolar fricative /s/; e.g. sidqun (‘His truthfulness’); 
‘emphatic’ (pharyngealised) voiced dental stop /d/: written as R <d> for (non- 
emphatic) voiced dental stop /d/; e.g. R.Vh'l for diya’un (‘Lights’); 

- voiced velar fricative /g/: written as 'll < h > for voiceless velar fricative /h/, 
not only in intervocalic position but also in initial position; e.g. 'iiTPI for 
ganiyyun (‘Rich’); 

- voiced pharyngeal fricative /‘/: written as h <’> for voiceless glottal stop /’/; 
e.g. hh.P1 for ‘aliyyun (‘Sublime’); 

- voiceless velar fricative /h/: written as ''ii <h> for voiceless velar fricative /h/, 
not only in intervocalic position but also in initial position; e.g. for 
haraqtu (‘I tore’); 

- voiceless pharyngeal fricative /h/ and voiceless glottal fricative /h/: mostly 
both written as >h <h> for voiceless pharyngeal fricative /h/; sometimes the 
Arabic voiceless glottal fricative /h/ is ‘correctly’ written as II <h> with the 


some of the Arabic loanwords in Harari, without being able to localise all of them in one single 
dialect. His conclusions are somewhat ambiguous. After remarking that ‘one is inclined to think 
that not one single Arabic dialect is to be taken as the source of Harari, but various Arabic dia¬ 
lects’, he says a few lines later: ‘As for the historically possible source of the Arabic loanwords in 
Harari, a South Arabic dialect seems to be more likely to be considered’. 

43 Leslau (1956) did not at all consider the possibility that at least some of the Arabic words 
he discusses could have entered Harari vocabulary from a learned, written source and not from 
a spoken variant of the language. An assessment of the loanwords taking this possibility into 
account might provide some fresh insights into the topic. 
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Ethiopian letter for the voiceless glottal fricative /h/; however the letter II 
<h> is sometimes also used for rendering the voiceless pharyngeal fricative 
/h/ and vice versa; the Ethiopian letter rh <h> sometimes also represents /h/; 
e.g. itih.un- for halim(u) {'Patient') and IIH. for hazzi (‘Swing!’); but also h.HO'i for 
ihsan (‘Doing good deeds’) and Hrhd- for zahara (‘He appeared’). 

The quality of the Arabic vowels /a/, /i/, /u/ is always reproduced accordingly in 
the text in fidal. As for quantity, only /a/ is sometimes rendered with the fourth 
order of the Ethiopian syllabary (<Ca>), which in the orthography of modern 
Harari also represents the long a (short /a/ written with the first order of the syl¬ 
labary (<Ca>), e.g. ''iirlTi-- for haraqtu (‘I tore’) and 'IMi.HT for wahidun (‘Unique’); 
however the fourth order long /a/ can also be used for an Arabic short /a/, e.g. 
Mdiri- for zahara (‘He appeared’). 44 As for Arabic /i/ and /!/, both are always written 
with the third order of the Ethiopian syllabary (<Ci>), as well as /u/ and /u/ being 
reproduced with the second order of the fidal (<Cu>). 

Diphthongs /aw/ and /ay/ are more or less consistently rendered in fidal (e.g. 
hd> for law [‘If’]), but many times the writing in fldal seems to point to an assimi¬ 
lated pronunciation of the /a/ to the following /w/ or /y/ e.g. V'lll-P" yowm for yawm 
(‘Day’), and heyr(u) for hayr(u) (‘Benefit’). 

It is worth noting that in many instances the word division does not cor¬ 
respond to the one used in Arabic; thus forms like ILHtiti. for bi-datihi (‘By His 
essence’) can be found together with r!.'IT,i(l.b i' ll Kin {falqalbil yowma ) for fa-qalbi 
al-yawma (‘Then my heart today’). 

To sum up the collected data, the rendering of the Arabic texts in fldal can 
be considered as being only partially accurate. Failure to represent many Arabic 
phonemes makes the transcription to a great extent imprecise; it is true that the 
Ethiopian syllabary lacks specific signs for some of the Arabic phonemes and for 
distinguishing the vowel quantity of /i/ and /u/, but no creative effort has been 
made by the copyist to fill the gaps in the writing system he was using and create 
a one-to-one sign set of transliteration. 

The written rendering in fldal of many of the Arabic sounds which are absent 
in Harari follows the same pattern as the phonetic adaptation of the Arabic loan¬ 
words into Harari. The word division in Ethiopian script is quite inconsistent and 
often only oddly corresponds to the Arabic original. The diphthongs are written 
in a way that hints at an assimilation of the short Arabic /a/ to the following semi 
consonant. 


44 See Wagner 2004, 358-359 for a discussion of the treatment of the vowels in the fidal ortho¬ 
graphy of Harari. 
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These elements possibly indicate that the writing process of the manuscript 
was carried out while hearing someone from Harar reciting the texts or while the 
copyist himself was reading them aloud. As a matter of fact, the writing down of 
the Arabic texts does not adhere to any written model: the copyist does not even 
try to rewrite in fidal a written Arabic Vorlage, but mostly transcribes the texts the 
way they would sound, if they were read by a Harari speaker. The copyist does not 
seem to have any intention to produce a well-thought-out system of translitera¬ 
tion of Arabic with the Ethiopian syllabary, he just aims at providing the faithful 
who are unfamiliar with Arabic script with a tool to access the texts and recite 
them on different religious occasions. 45 

To my knowledge this is so far the only known example of a substantial 
amount of complex Arabic texts written in Ethiopian syllabary. The general cul¬ 
tural trends in the Islamic communities in Ethiopia will tell us whether it will 
remain only an individual, isolated effort or the first step of a linguistic and liter¬ 
ary development among the Ethiopian Muslims. 
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al-FazazT(d. 626/1230) 242 
Ahundzade, Feth-Ali (Azerbaijani scholar) 
138 

Ali Shir Nawa’i (15 th century) 134 
al-MaghTlT 242 
al-Kansi 185 
al-Wartilani (1994) 279 
Anonymous 1833 (Si Hamet of Bejaia, 
marabout) 281, 288, 290 
Antal Gevay 150 
AsadTofTus 50 

BaladurT 40 
Basra 95 

Basset, Rene 280, 287, 290, 291 
Ben Badis 275 

Benton, Philip Askell (colonial officer) 246, 
251 

Bilawhar 66, 68 
Bogii Kagan 131 

Boulifa 279, 280, 282, 287, 290, 291 
BudTsaf 66, 68 
Bulan Nusatapi 229 

Cardinal Lavigerie, Catholic Archbishop of 
Algeria 281 

Chang Zhimei 196 

Cherif Kheddam 282 


Chuanchang Taiye ifiJScfcjflr Qutb ul-‘alam 
188 n. 28, 191 

Cisneros, Francisco Jimenez de 111, 115 

Daniel b. Azaryah 84 
Dutch Reverend Valentijn 227 
d’Hollone, Henri 183 + n.l8 

Elijah b. Zechariah 87 
Enver Pasha 138 + Fig. 3 
Evliya Celebi 256 n. 5 

Fakhr ul-DTn ‘Iraqi (d. 686/1287) 193 
Ferdinand II 111 
Fuzuli 134 

Galland, Antoine 112 

Gaza Sulayman al-GazzT (bishop) 96 

Hadj Boubekar 282 
Hamza al-lsbahanT 44 
Herrera, Pedro de 114 + n. 8 
Heyman, Johannes 143,144 Fig. 7 
Hillel b. Eli (court clerk) 78 
Hu Dengzhou 179 

Ibn al-Hajib 205 
Ibn Hamadush 277 

Ibn Ma'sum ‘Abd al-Rahman Ma Tianmin 
K 179,198, 200 Fig. 6, 201, 202, 207, 
210, 213 

Ibn Muqla (d. 940) 139 
Ibrahim Muhammad WazTr 303 
Ibrahim Muteferrika 152 
Imam Sifar Rijali 226-229 

Jaafar al-Bayati 76 
Judah b. Abraham 84 

Ka‘b b. Zuhayr 305 
Kangxi, Emperor J#gE 185 
Karaeng Pattingalloang (viceroy of Makassar) 
227 

Karakhanid, prince of Kashgar 132 


3 Open Access. © 2019 Dmitry Bondarev, Alessandro Gori, Lameen Souag, published by De Gruyter. 
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Kimelaha Salahak Abdul Kadir ibn 
Syahbuddin 231 
Krause, Gottlob Adolf 257-258 

Lavigerie, Charles Martial Allemand 281 
Laozi 185 

Leo Africanus (al-Hasan b. Muhammad 
al-Wazzan al-FasT) 253 
Le Roux, Jean Marie 244 n. 2 
Liu Jielian Sllt/ND 185 
Liu Zhi SlUK of Nanjing (d. 1745) 185 n. 22 
Lutfallah al-Kaydanl 202 

Ma Dexin Yusuf (d. 1874) 183 n. 19 

Ma Fuchu MW® 183 n. 19 
MahdT hagg‘Abdallah 303 
Ma Mingxin IjHfl'Ij (Wiqayatullah Muhammad 
Ja far) 184 + n. 20,185,186,191, 211, 
212 n. 56 

Ma Tianmin see Ibn Ma'sum 'Abd 

al-Rahman 

Ma Xiqing H#)# 198, 199 
Ma Xuezhi Muhammad Mansur 184 

+ n. 20 

Majira (member of the ruling family of 
Ternate) 224 

Mani (founder of the Manichaeism) 66 
Martin Luther 148 

Mas'ud III at Ghazna (Afghanistan) 41 
Mehmed Rasid 136 
Mehmed II see Sultan Mehmed II the 
Conqueror 
Manilak 299 

Mohamed Arezki Ferad 285 
Mondon-Vidailhet, Casimir 299 n. 11 
Moses Maimonides 87 
Mouloud Mammeri 283 
Mu Xianzhang (Hadrat Maulana 

al-A‘zam al-PinlianfuwT) 187 
Muhammad b. Tahir al-Magdub 304 n. 27 
Muhammad b. WasTf 41 
Muhammad ibn al-Haklm al-ZTnaml (?) 

al-Shandunl al-STnl 196 
Muhammad Ibn al-Walld al-TartusT 76 
Muhammad Mansur Ma Xuezhi SIMS' 184 
Muhammadu na Birnin Gwari 254 
Mulla ‘Isam al-DTn 204, 206, 207 Fig. 8 


Munif Pasha 138 
Murad III 149 
Mustafa Ali 143 

Najm al-DTn RazT 183 

Nissim b. Halfon (ll ,h -century Genizah trader) 
78 

Nurul-DTn Jam! (d. 898/1492) 193 
Nustapi/Nusatapi (Rijali’s lineage, see Rijali) 
226 

Palatics, Gyorgy 150 
Po Nachi 193 n. 33 

Prophet Muhammad 119,123,225 

Qasi Udifella 282, 287-289, 291 
Qianlong ^ IS, Emperor 185 
Qutb ul-‘alam (ChuanchangTaiye |p)@i;fk) 
191, 192 

ras Mak w annan for Casimir Mondon-Vidailhet 
299 n. 11 

Rashid al-DTn Fadl Allah’s (d. 1318) 183 n. 16 
Richardson, James (traveler) 243 
RudakT 66 

Rumphius, Georg Eberhard 224 

Saadiah Gaon 25, 81 
Sa'd al-DTn al-TaftazanT 204 
SadTqullah ibn Qamar 191 
SadTqullah of Xiji Malugou SMS 191 
Said Ali Cherif 277 

Sams-iQays 43, 45 n. 8, 48 + n. 13, 49 
Seyfullah Kasim el-Bagdadi 134 
Shaykh Aheddad 274,277,288 
Shaykh Ahmad al-TijanT 259 n. 16 
Shaykh El-Mokrani 274 
Shaykh Lmuhub Ulahbib 277 
Shaykh Oubelkassem 277 
Shaykh of Pingliang 186,187 
Shellabear, William Girdlestone 220 
Shisan Taiye 191 

Sidi Yahia ben Hamoudi 277 
Siyueba Taiye 191 

SlimaneAzem 282 
Sulayman al-GazzT see Gaza Sulayman 
al-GazzT 
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Sultan Hamza 227 + n. 6 
Sultan Mandar Syah 227 
Sultan Mehmed II the Conqueror 132,140 

Tengnagel, Sebastian 149 

UighurKhan 131 

‘Utman ibn Sa'Td al-Qutbl 250 

Uzun Hasan, Akkoyunlu prince 132 

Wali dan Masani (‘Umar b. Muhammad b. ‘Abd 
Allah b. Nuh al-Barnawi al-Kashnawi, or 
Dan Masani(h) 253-254 


Wang Hongxiang from Zhenjiang 

# 208 

Wiqayatullah Muhammad Ja'far see Ma 
Mingxin 

Wu Zunqi 183 n. 17 

Xiaoshan Wangye chilli® 189 n. 28 

Yakubu, son of Abdulahi Burja 239 
Ya'qub b. Layt 41 

Yusuf Khass Hacib of Balasagun 132 


Index of Polities, Ethnic Groups, Kingdoms, 
States, Towns, and Religious Denominations 


Aceh 218,222 
Addis Ababa 300,302,303 
Afghanistan 41, 44, 48, 50, 54, 56, 60, 64 
Ahvaz 63-65 

Ait Ourthilane (Algeria) 277 
Akkoyunlu 132 
Albania 151 

Algeria 16, 22,119, 244 n. 2, 273, 275-277, 
280, 281, 283,186, 289-291 
Almonacid de la Sierra 112 
Amazigh 283 

Ambon 223, 224, 228, 232 n. 9, 234 

Anatolia 132,139 

Arabic Melkite Church 94 

Aragon 23, 111, 112,114,115,119,120 n. 21,122 

Austria 145,149,151 

Ayyubids 73,76,87,90 

Baghdad 41, 76, 81, 132 
Balkh 48 

Bamiyan 50, 54, 56 
Banten 218 
Basra 95 
Bauchi 256 
Bejaia 277, 279, 281 
Biban region 282 
Bilad al-Sam 103-104 


Bima 222 

Borno 239-257, 262 
Bosnia 137,151,157 
Boudjelil 277,282 
Bukhara 48 
Byzantine orthodoxy 93 

Calanda 112 

Caribbean 241, 254 

Castile 23, 111, 112, 115,119,120 n. 21 

Chad 241, 242 Fig. 1,243, 253 

Chinese Turkestan 66 

Constantinople 93,119 

Cutar 112 

Dandan Uiliq 46, 61, 65 

Djemaa Saharidj 281 

Dire Dawa 300 + n. 15, 301 n. 18, 303 

Donggan 182 

Dongguan JKM Mosque in Xining 201 
Dongxiang ethnic group 182 n.14 

Fatimids 84 

Fatimid Egypt 25,76,85,89 
Fulani 239,240 

Toronkawa Fulani 240 
Fuqiang 185 
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Gansu province 182 + n. 14,183,184, 200, 
203 n. 50, 209 + n. 54 
Gaza 96 

Ghana 258, 259 +n. 16 
GhaznTn 48 
Gobir 253 

Goktiirk ‘Celestial Turkish’ confederation 131 
Gowa 223,224 
Ghur 44 

Granada 111, 112,114 
Guanghe 209 n. 54 

Harar 19, 295, 299 + n. 10 and 12, 300, 301 + 
n. 20, 302, 303, 304 n. 26, 310 
Hausaland 239, 240 
Hercegovina 151 

Hitu 215, 223, 224, 227 n. 6, 231 Fig. 3 
Hungary 150,151 

I bad! (areas of Libya and southern Tunisia) 
273, 274 

Iberian Peninsula 111, 120 n. 23 
Iran 20, 23, 39-41, 43-46, 49, 56, 59, 60, 
63-66, 69 

Istanbul 99,132,134,137 n. 4, 138, 145,151 

Jerusalem 93, 94 
Jesuits 182 n. 13 
Jinan (Shandong province) 197 
the Judaean Desert 99 

Kaifeng (Henan province, China) 208 + n. 53 
Kanem 239, 241, 243 

Kanem Sultanate 241 
Kanem-Borno 239, 243 
Kano 239, 240 + n.l, 254, 260, 262, 266 
Karakhanid dynasty 132 
Kashgar 132 
Katsina 239, 240, 253 
Kazakhstan 182 
Khotan 44, 54, 56, 61 
Koban (Qabul valley) 60 
KubrawT mystic 183 
Kufa 95 

Khorasan 48, 50 
Khuzistan 63 

Kongtongshan lll*|sjlL| 187 n. 26 


Kyrgyzstan 182 

Lake Chad 241, 243, 253 
Linxia 178, 182 n. 14, 198 n. 40, 200, 201, 
213 

Liujiapo 185 

Makassar 218, 222-224, 227 
Mali 239, 240, 264 
Mao (Chad) 241, 242 Fig. 1 
Mamluks 73, 76, 90 
Marv 48 
Melaka 218 

Melkite Church (Melkite Palestinian Church) 
93, 94,104 

Melkite Jerusalem Patriarchate 94 
Melkite monks 93 
Menhuan see TarTqa Jahriyya 
Mingera 183 
Mingxin 191 

Moluccas 16, 22, 217, 219, 222, 223, 231 
Mongol Empire 131 
Morocco 273 

Moriscos 16, 23,111-116,118,120,122,125, 
126 

Mozabite 273 

Mount Kongtong 187+n.26 
Mudejars 23, 111, 112-116,118, 

Mughals 20 

NaqshbandT (JahrT) 177,184 +n. 20 
Naqshbandiyya Sufi order 177,184 + n. 20 
Near East 81,133, 

Nigeria 24, 239, 241, 254, 255, 261, 
263-264, 

Ningxia province 179 + n. 6,191, 203 n. 50 
Niksar 133 

North Africa 81,133, 181 n. 9, 241, 259, 273, 
277 

Northwest China 16, 177, 179,193, 196, 203 
n. 49 

Novallas 112 

Ottoman Empire 21,131,133,140,145,151 
Ottomans 18 n. 23, 20,131, 132,139, 143, 
146, 150 
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Palestine 

South Palestine 94, 99,104 
Palestinian monks 93 
Pastrana 112 
Peres Biancs 275, 281 
Pingliang 186,187 + n. 26 and 27,190, 208 

Rum-Seljuk sultanate 132 

Sabinan 112 
Safavids 20 

Saffarid dynasty of Sistan 41 

Salonica 119,122 

Sarajevo 133 

Seleucid 44, 60, 63 

Seljuks 132,139,141,146 

Sevilla 112 

Samarkand 45, 66 

Shaanxi province 179 + n. 6,182 

Sinai Peninsula 94 

Sistan 41, 46, 49 

Slovakia 151 

Sokoto Caliphate 240, 254 
Songhay 259 
Songhay Empire 253 
Soummam valley 277 
Sous Valley (Morocco) 273 
Spain 81, 87, 111, 113-115,122, 124 
St Catherine’s monastery 94, 96, 98 
St Catherine Library 93, 94, 97, 98 
St Kariton monastery 93 
St Saba monastery 99 
Sudanic Africa 

Central Sudanic Africa 253 
Western Sudanic Africa 253,257 
Sumatra 218, 222, 229 
Syria 20 n. 25, 274 
Sufism 177, 189 n. 29, 274 

Tala Uzrar (near Bejaia) 277, 278 Fig. 1 
Tang-iAzao 60, 61, 62 n. 27 
TarTqa Jahriyya (menhuan) 184 n. 20,185, 
187, 191+ n. 31,192, 211, 

Tercan (Otlukbeli) 132 
Ternate 223, 224, 227 + n. 6, 228 
Tidore 223,228 
Tiimiouine (Ouzellaguen) 277 


Timurids 132 
Tizi-Ouzou 273, 281, 283 
Taoism 185,187+n. 26 
Toledo 115 
Tongxin area 179 n. 6 
Torrellas 112 
Transoxiana 48 
the Trengganu Stone 220 
Tunisia 119,173 
Turfan oasis 66 
Turkestan 66,131,132 
Turkey 

The Republic of Turkey 131,143, 310 n. 
45 

Turks 

West Oguz Turks 131-133,136,137 

Uighur Empire 131 
Uighur Khanate 131 
Urreadejalon 112,120 
Uzbekistan 182 

Valence 115 
Valencia 112,115 
Vienna 50,147,149 

Wangara /Wangarawa 240 + n. 1 

Young Turks 136 
Yunnan 183n. 19 

Xi’an 182, 186 
Xixiang U# Mosque 182 
Xining 178 n. 3,181 n. 10, 200 + n. 43, 201, 
213 

Zaouia (religious school) 25, 273-275, 277, 
279, 282, 287 
Zaria 254,258 
Zhenjiang 209 
Zhongwei 191 
Zoroastrian religion 40 
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Index of Languages and Scripts 


Abbasid script 

Early Abbasid Script 29, 97,101,102 
Ajami (also ‘agamf) 16 + n. 19,19,196 n. 39, 
237, 240, 243, 250, 253, 254, 258 
Eastern Fula Ajami 259 
imala type 250 
Harar 299 + n. 10 

Hausa 251, 252, 254, 255, 256-258, 
260-266 

KanembuAjami see Kanembu 
Kanuri Ajami 241,243-244,246,248, 
252, 254, 257, 265 
Zaria Ajami 254 
Algarabia 113 

Aljamfa 112-115,117-119,124 + n. 32,125, 
160 

Amharic 297-299, 300 + n. 17 
Arabic 11, 13,14-31, 39-69, 73-84, 88-90, 
93-105,111-126,132-133,134,136-137, 
139, 142,147,148,150-162, 168,177 
+ n.l, 178-214, 217-234, 237-239, 
244-255, 257-266, 273-291, 297-310 
Algerian Arabic 277, 286, 289, 291 
Andalusi Arabic 117 
Arabic Harari 305-306 
gilit (bedouin variety of Baghdad Arabic 
dialect) 76 

Classical Arabic 14 + n. 17,18 + n. 22, 

19, 57, 74, 79-83, 87-89, 96 n. 4, 104 n. 
21,117, 210, 274, 286, 288, 291, 304 n. 
28, 308 

manuscripts 22, 23, 93, 94, 95, 97, 98, 
100, 101,112,116 n. 10, 177, 183 n. 19, 
210, 255, 257 

Middle/Mixed Arabic 79, 95,103,104 + 
n. 21 

Modern Standard Arabic 95 
Old Christian Arabic 93, 97,104 
Pegon (Arabic in Javanese) 219 
qeltu (urban variety of Baghdad dialect) 
76 

Standard Arabic VII, 19, 27, 30, 88, 95, 
96 n. 4, 261, 273, 275, 289 


Arabo-Persian 16 n. 20,18, 20 n. 25, 25, 39, 
40, 43-47, 49-50, 53, 54, 56, 58, 59, 61, 
63-69,131, 137, 139,141, 143 
divani (Arabic dlwanf) 139,143,144 Fig. 7 
naskhi (or nesih in Turkish) script 29, 
99 + n. 10,102,139, 140 Fig. 4, 147, 288 
ruq‘a (or nk‘a in Turkish) script 139, 

140,143 

Austronesian languages 15 n. 18, 229, 232 
n. 9 

Malagasy 15 n. 18 

Malay 15 n. 18,16, 22, 25, 28, 217-234 
Sulu 15 n. 18 

Babylonian 73, 90 
Balochi 15n.l8 

Berber 15 n. 18,16, 22, 28,181 n. 9, 253, 
273-280, 283-285, 289 
Moroccan Berber 289 
Tamazight 276, 283-285 
Bosnian 151,157,160 
Brahmi script 217, 218, 229 

Castilian 21,112-115,117 
Caucasian languages 15 n. 18 
Central South Slavonic (CSS) 151,155,158, 
159, 161 

Chadian/Shuwa Arabic dialect 253 n. 5 
Chadic languages 253 
Chagatai (language) 19,134,137 
Chinese see Sino-Arabic 

manuscripts 177,178 n. 3,181 + n. 8, 
183 + n. 19,193,197,204 
calligraphy 19, 29,181 
dialect 178,183,206 
Croatian 147,148,150,151,152,156,160, 
162,163, 164 
Croato-Serbian 157 
Cyrillic 148,151,152,157,182 

Dagbane 259 + n.l6 

Dalmatian 155 

Divani see Arabo-Persian 

Dravidian language Mopiah 15 n. 18 
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Eastern Kalimantan (Malay dialect) 229 
Estrangelo see Syriac script 
Ethiopian script 19, 297, 298 n. 9, 299-300, 
300 n. 14, 301 n. 18, 302, 308, 309 
Arabic 297-298 + n. 3, 299 + n. 10, 300 
+ n. 13, 301-310 

fidal VII, 14,16,19 + n. 24, 297-301, 
304, 306, 309, 310 
Old Harari 297, 299 n. 10, 301-302 
Oromo 302, 303 

Fidal see Ethiopian script 
Fula 253, 257, 259, 262, 264 
Eastern Fula Ajami 259 
Fulani 239,240 

ge’az 298 n. 3 

German 18,147,148,150,152-153,154 n. 

6,155,156, 159, 161-164, 167-169, 173 
n. 12 

Alemannic dialect 6 n. 7 
Central Bavarian 163,173 n. 12 
East Middle German (Saxonian) 7 n. 7 
Middle High German 163,167 
Modern Standard German 167 
Low Middle German 6 
Old Frisian 6 

Upper German (Bavarian-Austrian) 6 n. 7 
West Upper German (Alemannic) 6 n. 7 
West Middle German (Franconian) 7 n. 7 
Glagolitic 148,152,157 
gilit dialect see Arabic 
Gobir language 253 

Greek 19 n. 23, 47, 77, 93, 94,103,131,151, 
161, 288 

Greek-Arabic 98,100 n. 12 
Gujarati see Indian languages 

Harari 299 + n. 10, 300-301, 305, 306-307. 
308 n. 42, 309-310 
Old Harari 297, 299, 301-302 
Hausa 16,19, 26, 28, 237-241, 245, 249 n. 4 
Standard Hausa 256,258,260,261 
Western Hausa dialects 260-261 
Hausa Ajami see Ajami 
Boko (Hausa in Latin script) 261 
Hanzi 181 n. 10,199 


Hungarian 18,19 n. 23,147,148,150,152, 
155, 156,158-160, 161-163, 163 n. 9, 
164,166 + n. 11,167-169, 173 n. 12 
Hebrew 14,16, 39, 49, 65, 71, 74, 77, 83-84, 
87, 88 

alphabet 58, 59, 61, 63, 65, 74, 75 + n. 5, 79, 
81-82, 89 

characters 16 n. 20,18, 60, 61, 62, 63, 75 n. 

5, 78, 80, 81, 88, 298 n. 3 
loanwords 60, 75 n. 5 

Indie languages 
Gujarati 218 

Iranian languages 15 n. 18, 46, 58-59 + n. 

21, 66 

Balochi 15 n. 18 
Iranian dialects 60 
Pashto 15 n. 18 
Kurdish 15 n. 18 

Saroni Kurdish (spoken in Iraq) 289 
Javanese 223, 229 
Javanese script 219 

Pegon (Javanese in Arabic script) 219 
jawi 16 +n. 18, 217-234 
Judaeo-Persian 44, 62 n. 27 
Judaeo-Persian orthography 58 
Judaeo-Persian texts 18, 44, 45, 46, 59-60, 
63, 65 

Judaeo-Arabic 8, 16 + n. 20, 18, 19, 20 n. 25, 
23, 25, 65, 73, 75 + n. 5, 77-78, 79, 81, 
82, 84, 87, 89, 90 

Classical Judaeo-Arabic 80, 82, 83, 88, 
89, 90 

Early Judaeo-Arabic 82 

Kabyle 16, 22, 25, 273-287, 289, 291, 292 
Kanembu 241, 242 Fig.l, 243, 244, 245, 246, 
250, 257 

Kanembu Ajami 244,247,248 
Kaidi Kanembu 245 
Ngaldoukou Kanembu 244, 246 
Kanuri 16,19, 26, 28, 235, 238-255, 257, 
259, 264-266 
Mowar Kanuri 246 
Yerwa Kanuri 246 

Kashmiri 15n.l8 
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khatt-isini see Sino-Arabic 
Kufic script 41 

Lanyin Mandarin (dialect in Linxia) 178 + n. 3 
Latin 6 n. 7, 15,18, 19 n. 23, 56,124 + n. 32, 
125, 147,148, 150, 151-153, 155-164, 
167-169 

Classical Latin 155,159 
Medieval Latin 155 
Neo-Latin 155 
Libyan 273 

maghribl script 99,100, 288 
Malay [language] 217-225, 227, 228 n. 7, 
229, 232-234 

Ambonese Malay, dialect 232 n. 9 
Old Malay 229, 232 
Kutai Malay 229 
Malay Jawi Script 16 
Malayalam 15 n. 18 

Mopiah (dialect) 15n.l8 
Mamprule 259 + n.l6 
Mandarin 178,181,199, 201, 202 n. 45, 203, 
207, 208, 

Mande(Jula) 240 

Manichean 18, 39, 44 n. 6, 45, 59 n. 21, 66, 
68,131 

documents 47, 59 n. 20, 68 n. 31, 69 
Manichaean New Persian 46, 47, 58, 59 
n. 21, 60, 66, 68 n. 31 
orthography 49, 59, 66-67 
Masoretic Syriac 20 n. 25 
Modern Hausa Boko spelling 262 
Montenegrin 151 
Mopiah see Malayalam 

Nabatean inscriptions 97 n. 6 
naskhl script 29, 99 + n. 10,102,139,147, 
288 

nasta‘lTq script 140 
Northern English 7 

Old Kanembu 22, 24, 26, 241, 242 Fig. 1, 
243-253, 265, 266 
Old Malay inscriptions 229 
Oromo see Ethiopian script 


Ottoman Turkish 8,15,16,19, 22, 27, 29, 

133,134, 137, 144,147-148,152,153, 
155-157, 160-162,164,168,169,181 n. 
9, 289, 290 

Pahlavi script 40 

Palestinian 16, 73, 90, 93, 94, 97, 98, 99, 
103 

Parthian (Iranian language) 66 

Pegon see Javanese 

Persian 

Afghani Persian 42 

Arabo-Persian see Arabo-Persian 

Classical Persian 8,56 

Early New Persian 42-45, 62 n. 26, 56 

Middle Persian 40, 44-46, 55, 64, 66, 

68 n. 31 

New Persian VII, 8, 16, 18, 22, 23, 40, 
41-44, 45 n. 7, 46, 47-49, 50, 54, 57, 
59, 64 

New Persian orthography in Arabic 
script 23, 25, 41, 43, 44, 45, 49, 
50, 57 + n. 18, 58 
New Persian texts in Syriac script 
39, 40 n. 1, 49, 58, 60, 65, 66 
Persian dialects 45, 48, 50 
Pinyin (transliteration system for Mandarin 
Chinese) 178, 182 n.13,194 n. 37,199, 
203 n. 48, 208 
Portuguese 223 

Romance language 111, 118,125,155 

Sardinian 155 
Scottish 7 

Serbian 19 n. 23,151,157,160 

Serbo-Croatian 151,157 

Sinaitic kufi / Sinaitic-Palestinian kufl 41, 97 

Sindhi 15n.l8 

Sino-Arabic script 177-204 

khatt-i Sin! 29,177,181,184 + n. 20, 
186,191,193,196, 202, 208-211 
xiaojing see xiaojing 

siyakat script (Ar. siyaqah) 26,141,142 Fig. 
6,143 

shikastah (Turkish §ikeste ) 140 
Sogdian 46,58,66 
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Sogdian script 131 
Songhay 240,255,259 

Zarma Songhay (Songhay-Zarma) 255 
Soninke 240,253,265 
Spanish (Castilian and Aragonese) 21, 111, 
112,115,116,120,121,122, 124-126, 223 
Swahili 16n.l8 
Syriac 20 n. 25, 39, 49, 58 
Syriac script 49,58,60,65,66, 
estrangelo 97 + n. 6 
Syriac serta script 97+n.6 
Syrian 102 
Syrian dialect 103 

ta'liq script 140,141 Fig. 5 
Tamazight see Berber 
Tamil 8, 15 n. 18 

Standard spoken Tamil 8 
Tarjumo see Old Kanembu 
Tifinagh 283-285 


Keywords 

Authority 17, 20-22 
Ahvaz legal document 65 
Alphabet (definition) 10-11 
Arabic Meikite Church 93, 94,104 

Bible 25, 60, 65, 74, 77, 81, 87, 89, 93, 94 + 
n. 2, 281, 287, 288, 290 
Bible fragments 77 

Bilingualism 49, 56, 66, 98,100 n. 12,193, 
202, 284, 289, 298, 300 
Bookmaking 3 

Cairo Genizah 23,77-78,89 
Calligraphy 19, 29, 136 n. 4, 146, 226 
Chinese 181 

catechism 50, 59, 68 n. 31,178-179, 197,199 
Fig. 6, 200-201, 210 
Islamic catechism (‘AqTda) 198 + n. 40 
Christian 14,16,19, 23, 40 n. 2, 65, 74-77, 
93-104, 111, 113,122, 124,126,131,148, 


Neo-Tifinagh 283 
Tigrinya 299, 300 n. 17 
thuluth script (stilus) 139 
Turkish 289, 290, 310, 8, 15, 16,18 n. 23, 19, 
21, 28, 58,131-139,144,145 
Azeri Turkish 19,134 
Standard Turkish (Stand. Trk) 133,156, 
157 

Western Rumelian dialects (Balkans) 156 

Uighur script 131,132,137 
Urdu 15-16 n. 18 

Wolof 181 n. 9, 253 

xiaojing 25,177-214 

Yiddish 77 

qeltu type see Arabic 


150, 217, 224, 264, 281, 297, 298 + n. 5 
and 6, 299 

Christian Arabic manuscripts 18, 22, 93-105 
Codicology 3 

‘Comparative standardology’ 3 n. 2, 5-7, 9, 
16,17, 30 

Contact VII, 6-7,14,16,17-19, 27, 29, 30, 
87, 223, 238-239, 252, 291 
Copyists 1, 22, 30, 40, 95, 115, 124, 133 

De-standardisation 6-7, 9 n. 12, 78, 89 
Dialect 4, 6 n. 7, 7 n. 8,15 n. 18, 26, 44, 45, 
46, 48-50, 55, 57, 59 n. 19, 60, 76, 78, 
82, 83,103,104,117,132, 151, 156, 163 
n. 9,167, 173 n. 12, 177 n. 1,178 + n. 3, 
183-184, 200, 201, 203, 206, 208, 213 
+ n. 57, 222, 229, 232 + n. 9, 234, 241, 
243, 244, 246, 253 n. 5, 256, 259-260, 
261, 262, 276, 277, 285, 286, 289, 307 n 
42, 308 n. 42 
Diglossia/diglossic 7 
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Indices 


Domains (of manuscript production, of 

language, of culture) VII—VII1,1-2, 3, 6, 
12,13, 14-18, 22, 27, 30, 31, 159, 238, 
241, 276 

Education 1, 22, 24, 73, 74, 81, 83, 84, 87, 
89-90, 133,136, 143,150,177,179, 182, 
217, 221, 241, 273-276, 282, 284, 287, 
292 

facsimile manuscript 183 n. 19,184 n. 20, 

195 + Fig. 4, 196,197 Fig. 5,198 + n. 40, 
208, 214 n. 58 

Factors of standardisation 2,16-17, 22 

frequency (of linguistic items) 27, 88,158, 
237, 246, 247, 252, 266, 288, 289 
format VIII, 1, 2,14-15, 29, 30,134,146, 
177-178, 190,193,198 n. 40, 202, 203, 
204, 206, 210-212, 214 

gargam/girgam (a genre of historical 

chronicles of Borno) 243, 246, 247, 250, 
252 

genre 8, 14.15, 17, 21, 24-26, 73, 82, 88, 90, 
143, 145, 238, 243 

glossary 59 n. 20, 117,183 n. 19, 204, 206, 
207 Fig. 8, 208 + n. 53, 283 
glosses 181,183 + n. 19,184,189,190, 191, 
193, 195-197, 202, 206, 209-212, 240, 
247, 252-255, 257, 258, 260-263 
glottal stop 42, 53, 57,103, 220, 232, 
261-262, 308 

Gospel 65, 95 n. 2, 99-100 
grammar 6, 74, 81, 96 n. 4,103,104,136,178, 
182 n. 12,183 n. 19,195-196,197 Fig. 5, 
198 n. 40, 204, 210, 233, 274, 280, 
graph (definition) 11 
grapheme (definition) 11 
grapheme-phoneme combinations 237, 266 
graphemic conservatism 244 

Hebraisation 73, 87, 89 

high-frequency items 235,266 

historical sociolinguistics 3 n. 3, 4, 5, 8 n. 10 


identity 4, 17, 20, 21, 22-24 + n. 27, 26, 

74-75, 78, 89, 93-94, 115,126, 275, 283 
imala type 250 

inscriptions 40-41, 60-61, 74, 97 n. 6,131, 
139,143, 220, 229 
Old Malay inscriptions 229 

Jews 19, 65, 73, 74, 75 + n. 5, 76-77, 87-90, 95 

language codification 4,5,6,8,24, 
language change 4, 9 n. 11,133,155 
language (as factor of standardisation) 26-28 
language variation 2, 4, 5, 7, 26, 78,178, 256, 
289, 

layout VIII, 1, 3,12-15, 21, 26, 29, 30-31, 
93-94, 101-102,177-178, 190,192, 210, 
211, 222, 228, 241 
letter (definition) 11 

linguistics 2, 3 + n. 2, 4, 5-6, 8,11,15, 26, 
100 n. 15 

literacy studies 2,5,9 

literacy in Ajami 237, 241, 243, 252, 254, 

259, 260, 261, 263 

literature 4,16, 20, 30, 40, 60, 66, 74, 77, 

94, 97 n. 4, 99, 104 n. 21, 120 + n. 23, 
124, 126, 132,133,151,180, 253, 279, 
292, 297, 298, 299 n. 10, 301 
liturgical text 29,211,212 

madrasa education 177,179,183,196,198 n. 

40, 200, 203 n. 50, 204, 211 
Manuscript production 13-18, 21, 29, 30 
Manuscripts 

Multiple-text manuscript 119,140 Fig. 4,144 
Fig. 7,147-148,168, 

Medium (as factor of standardisation) 15,17, 
28-29 

Melkite monks 93 
Middle Ages 122,124 
Mishnah 77 
Monastic collections 93 
Monastic scriptoria 97 

Network 17, 20 + n. 25, 21-23, 26, 77-79, 87, 
89-90, 218, 222, 274 
neologisms 276, 283 
New Literacy Studies 5 
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New Testament 95 n. 2 

Non-literary Arabo-Persian documents 44 

Old Testament 98 

oral transmission 1, 81, 94 n. 2, 257, 285, 

307, 308, 310 n. 45 

orthography (definition) 8-9 + n. 12,10 
orthographic / graphemic choices 235, 265, 
266, 287, 290, 292 

orthographic /graphemic conservatism 244 
orthographic / graphemic innovation 235, 
244, 245 

orthographic / graphemic variation 10, 27, 31, 
238, 239, 264, 

Pact of‘Umar 74, 76 + n. 7 

palaeography 3,11,14 n. 17, 97, 217, 219, 222 

philology 3 + n.3,177 

phone (definition) 11 

phoneme (definition) 11 

phonetic orthography 26, 73, 79, 83, 89 

poetry 6 n.7, 25, 41 + n. 5, 44, 50, 57, 66, 

74, 77, 140,148,186 + n. 24, 187-189, 
191-192, 210, 274, 276, 281-285, 
287-289 

prayer 96,121, 126,148, 152, 177, 178, 198, 
200 n. 44, 202, 206, 209 + n. 54, 214 
Fig. 10 + n. 58, 263, 291, 298 n. 3 and 8, 
304, 305, 306, 310 n. 45 
Pronunciation 9 n. 11, 20 n. 25, 27, 45 n. 8, 

46 n. 9, 49, 50, 56-57, 59 n. 19, 80, 82, 
88,114,133, 155, 159,160 n. 8,161,167, 
168,178,180, 199-201, 203,219, 222, 
260, 261, 262, 280, 288, 289, 309, 310 
n. 45 

Proverb 184,188-189 + n. 28, 213, 285 
Psalter 49, 66, 98 

qasTda 50, 66-68 + n. 32, 211, 242 Fig.l, 

250, 259 n. 16, 305 

Qur’an 2,14,18, 74, 94.95, 99, 104, 119,123, 
139,181, 203, 209-212, 217, 219, 237, 
241, 244, 247, 248, 251, 252, 274, 284, 
288, 289, 291, 298 n. 5 and n. 8, 299, 
308 

Qur’anic manuscripts 243, 246, 247, 250, 252 


sacred biography 184 
salience (of phonetic features) 27 
scribal schools 22,23,81,83,90,97 
scribe 1,10,13,17,19 n. 23, 20, 22-28, 68, 
77, 78 + n. 8, 83, 84, 87, 89, 95, 100, 

102, 115,134,137,147,152-154, 156, 157, 
158, 159,163,164 n. 10,168,178,189, 
191 + n. 31,196, 209, 210, 211, 222, 232, 
240, 243, 244, 245, 249, 251, 252, 254, 
260, 261, 263, 264, 265 
script (definition) 10-11 
script style (definition) 11 
script type (definition) 11 
Sino-Arabic manuscripts 173-214 

sociolinguistics 2, 3 + n. 3, 4 + n. 5, 5, 8 n. 

10, 9, 17 n. 21, 20, 74 

speaker 25, 26, 74 + n. 1, 77, 89, 104,114, 
152, 160 n. 8, 221, 222, 241, 243, 244 
n. 2, 255, 256, 261, 262, 276, 279-280, 
300, 307, 310 
spelling: 

spelling (definition) 10-11 
etymological spelling 238,246,266 
historical spelling 19,60,260 
spelling conventions 1, 8, 9 n. 13, 

18-19, 22, 25, 79, 81, 88, 243 
spellingvariation 8, 26-28, 158, 245, 
246, 247, 287, 290, 

spoken language 5, 8 + n. 10, 26, 30, 4583, 
193 n. 19,114,133,153,166,178, 201 
stabilisation 1,18, 21, 25, 26, 28,116,145, 
246, 251, 292 

stable (sub)sets of grapheme-phoneme 

correspondences 27, 31,153, 265, 266 
‘standard language culture’ 6, 7 
Standards of manuscript forms, layout 
and script 1,12-15, 29-31, 93-104, 
210-212, 228 

Standardisation of speech / spoken language 
5, 7, 8, 30 

Standardisation of writing 7, 8, 9,12, 74, 81, 
84, 88-90, 220, 

subsystems of standardisation (see also 
(sub)stabie sets of grapheme-phoneme 
correspondences) 27, 31,153, 265, 266 
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Indices 


Tafsir 64 + n. 29, 65, 211 
takhmls style 241 
Talmud 77 

textual transmission 24, 25, 40, 66,160, 
192, 211, 218 

Token (definition) (also see Type and token) 
11-13 + n. 15 

transcription 11, 26, 54 n. 16, 75 n. 5,152, 
156,178, 182 n. 12,183 n. 16 and n. 19, 
184 + n. 21, 185,190, 193,194, 196,198 
n. 40,199, 200, 203, 204, 206, 211, 213, 
214 n. 58, 254-257, 259-264, 273, 277, 
279-280, 282-284, 286, 289, 290, 308, 
309, 310 n. 45 

transcription system 75,152,156,178,182 
n.12,183 n.19,185, 199, 206, 254, 257, 
259-264, 273, 279-280, 282-283, 286, 
290, 308, 310 n.45. 
transition from manuscript to print 30 
translations 60, 65, 81, 94 + n. 2, 95 n. 2, 
120,122,124,125, 151, 184 n. 19,193 +n. 
33,194, 198 n. 40, 202-204, 206, 208, 
210, 211, 214, 243, 277, 279, 283, 284, 
285, 287, 290, 291 


type (definition) (also see Type and token) 12 
type and token 12-13 + n. 14 
type and style of script 11,12 

uniformity VII-VIII, 21, 29, 31, 237-238, 252, 
264 

vocalization 53, 56, 255, 257 

Warsh tradition of Qur’anic spelling 
250-251, 257 

written language 1, 2, 4, 6, 7,10, 21, 25, 30, 
40, 73, 75, 76, 78, 97, 104 n. 21, 114, 132, 
153, 169, 252-254 

Written language standardisation (see 
Standardisation ofwriting) 
writing system (definition) 11-12 
written traditions of Iran 39, 60, 69 

zaouia (religious school) 25, 273-275, 277, 
279, 282, 287 



